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1 Introduction 


Mathematics is “the science of numbers, quantities, and shapes and the rela- 
tions between them”. Its history dates back almost three millennia — indeed, 
many insights found in this book have been known for centuries. The dri- 
ving force behind the development of mathematics may be found in human 
nature: As a species, we seem to be uniquely interested in finding rational 
explanations for regular or repetitive phenomena in order to predict them. 
If not for this curiosity that appears to be innate to our species, many of 
the insights in this book may never have been discovered. In cultural terms, 
it took us a very long time to phrase these explanations in the language of 
science, and thus, to give them actual predictive power. 

Mathematics has been (rather appropriately and beautifully) described as “a 
self-contained microcosm, with a strong potentiality of mirroring and model- 


ling all the processes of thought and perhaps all of science”.? 


Mathematical questions, and the involvement of the computer 


There was a short period when a substantial amount of people believed that 
the computer would eventually cause mathematics to be partially replaced. 
However, mathematics is not merely the calculation of computational steps 
— its core is to be found in the logical thinking behind those steps. 

In other words, the question is not how, but why certain computational steps 
have to be made. Once this is determined, the (often tedious) computation 
itself may actually begin. For this, the computer is truly a blessing, as it 
allows us to outsource the whole burden of routine operations, and to clear 
our minds for the comprehension of the relations from which they derive. 
With a clear mind, it is much easier to tackle more complex problems, and 
to discover — sometimes — that “one may proceed with the same tried and 
tested methodologies”. 


Why is there an upper bound for the size of insects, 

and a lower bound for the size of warm-blooded ani- 
\ mals? The answer follows from an important theo- 
N rem about similar objects. 


thttp://www.merriam-webster.com/dictionary/mathematics 
2Marc Kac, Stanislaw Ulam: Mathematics and Logic, Dover Publ., 1992. 
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When and why does a ship capsize? This is a typical 
question for vector mathematics. 


What implications does Kepler’s Second Law have 
for the seasons? To answer this question, we need to 
employ knowledge about trigonometry and ellipses. 


How does GPS (=“Global Positioning System”) 
work? How many satellites are required to determi- 
ne a spatial position, and how should their orbital 
paths be aligned to cover the Earth’s surface most 
efficiently? When should one expect navigational 
problems? These questions require the techniques 
of analytic geometry. 


Why do rainbows exist? Why does the sun appear 
as a glowing red fireball even after it has, from a 
purely physical point of view, already disappeared 
behind the horizon? In both cases, the answers are 
to be found in the refraction of light, which may be 
investigated using differential calculus. 


What explains the intriguing spirals in sunflowers, 
the impressive shapes of antelope horns, or the be- 
autifully geometric forms of snail shells? To ans- 
wer these questions, we need to employ exponential 
functions. 


How should a lawn sprinkler be moved so that the 
lawn may be irrigated evenly? How much air is con- 
sumed in a multi-level dive? What is the average 
life expectancy of people that have reached a cer- 
tain age? Here, we need to employ integral calculus. 


Why did natural selection “design” the heartbeat 
to be slightly irregular, and why can election re- 
sults be accurately predicted by counting only 10% 
of the votes cast? The “law of large numbers” and 
descriptive statistics may be employed here. 


How were the mathematical proportions of a musi- 
cal instrument’s strings first defined by Pythagoras, 
and how did musical scales and tonal systems evol- 
ve in the history of music? This concerns propor- 
tions that do not only make sense mathematically, 
but that are considered by people to be “harmonic”. 


Last but not least: How does a computer evaluate 
function expressions, and under which circumstan- 


ces should we not rely on its alleged accuracy? 
The answers to many of these questions become especially clear when its 
underlying principles are demonstrated as “animations”. Consider a rocking 
boat, whose motion can be traced from any angle on a computer screen, 
which makes it easy to see which torsional momenta conspire to return it 
to its upright state. Consider a visualization of the orbital acceleration of a 
planet as it avoids being swallowed by its parent star, or a time lapse showing 
the growth of an antelope’s horn. Such animations are easily accomplished 
through modern computers, but the underlying mathematics has been known 
since a much earlier time! 


Guiding principles of the book 


This book traces its origin to the lectures on Applied Mathematics that I have 
been conducting for many years, for the benefit of students of architecture 
and industrial design, at the University of Applied Arts Vienna. I have tried 
to make the topics accessible to a much wider audience — essentially, to all 
people interested in the relations between mathematics and the most diverse 
of disciplines. My aim is to help put existing mathematical knowledge into a 
more structured and practical context. 

In the meantime, parts of this book have already been published, mainly in 
the books 

e G. Glaeser: Der mathematische Werkzeugkasten, Springer Spektrum, Hei- 
delberg, 4th edition 2014. Some applications are also to be found in 

e G. Glaeser: Nature and Numbers, Ambra V/DeGruyter, Vienna, 2014, 

e G. Glaeser: Geometry and Its Applications, Springer New York, 2012, 

e G. Glaeser, K. Polthier: Bilder der Mathematik, Springer Spektrum, Hei- 
delberg, 3’ edition 2014. 

e G. Glaeser, H. Stachel, B. Odehnal: The Universe of Conics. From the 
ancient Greeks to 21st century developments. Springer Spektrum, 2016. 
Additionally, some biological applications can be found in 

e G. Glaeser, H.F. Paulus: The Evolution of the Eye. Springer Nature, Hei- 
delberg/New York 2015, 

e G. Glaeser, H.F. Paulus, W. Nachtigall: The Evolution of Flight. Springer 
Nature, Heidelberg/New York 2017. 

This book is not a treatise on mathematics in the classical sense, in which 
definitions, theorems, and proofs are strung together. Many of the practi- 
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cal examples within encourage interconnections between various, not always 
strictly technical, physical disciplines, such as biology, geography, archeolo- 
gy, medicine, music, applied arts, etc. Their larger purpose is to heighten 
awareness of the interconnecting laws underlying art and nature. 

Thus, I have attempted to create a self-contained reference, aimed not at 
strict mathematicians, but at more pragmatic readers. The following base 
principles were followed whenever possible: 

e Each chapter starts with a concise introduction to the theory, supported 
by various practical examples. Very often, a thorough theoretical education 
is not enough — when solving a particular challenge, one needs to be able to 
recognize the nature of the mathematical problem at its core. 

e | have consciously avoided a strictly mathematical way of phrasing ideas, 
since it tends to distract more casual readers from the essence of the parti- 
cular example. Details can always be elucidated once the relevant essence of 
a theorem has been understood. 

e Proofs are conducted with appropriate rigor. However, I try to equip readers 
with the necessary knowledge to “derive” certain relations purely on the basis 
of common sense. 

e A geometrically insightful sketch is usually preferred to mathematical ab- 
straction, and for this reason, higher-dimensional problems are seldom found 
in this book. 

e The general case — despite being sometimes more elaborate — is usually 
preferred to the special cases and their various subtleties. Thus, more complex 
equations of the form f(a) = 0, or definite integrals he f (x)dzx, are considered 
“solved”, as the results can always be approximated by a computer to an 
arbitrary degree of precision. 

e References to literature are supplemented with relevant web addresses, for 
the very simple reason that access to the Internet has become ubiquitous, 
while access to university libraries is not quite so easy and immediate. 


Structure of the book 


The contents have been reduced to several chapters: 

e First (Chapter 2), the fundamentals of applied mathematics are repeated 
on the basis of practical examples. A particular focus is laid upon algebraic 
equations and systems of linear equations. 

e The third chapter discusses proportions through various examples. Similar 
bodies are of particular interest, and their analysis leads to simple but non- 
trivial conclusions that are enormously influential in nature. 

e The fourth chapter is dedicated to calculations related to right and oblique 
triangles — and thus, to trigonometry. Here, too, there exists a wide scope of 
applications in various fields of inquiry. 

e In the fifth chapter, the topic of vector mathematics is covered — though 
still with the means of elementary mathematics. Vectors play a fundamental 
role in physics, and are key to the application of geometrical problems on 
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the computer. Perhaps for this reason, their prominence and importance 
has grown enormously in recent times. The ease and elegance of the related 
calculations are demonstrated through many practical applications. 

e The sixth chapter deals with the classical real functions and their derivati- 
ves. They are fundamentally important for many calculations and have lost 
much of their terror since the dawn of the computer age. 

e The seventh chapter concerns curves and surfaces. Locus lines and path 
curves of so-called constrained motions are discussed in particular. Here, 
the computer is employed for their visualization. The software used for this 
purpose has been developed at the University of Applied Arts Vienna. The 
website of the book contains dozens of executable demo programs. This en- 
courages geometrical creativity, as well as a sensible use of modern computing 
technology. In working with computers, the results should always be evalua- 
ted with a healthy dose of common sense. 

e The eighth chapter guides the reader through simple and important appli- 
cations of calculus. Complicated evaluations are performed on the computer. 
This affords enough space to promote the comprehension of general equations 
for the calculation of surfaces, centers of mass, etc. 

e The ninth chapter is dedicated to statistics and probability calculus. The 
importance of these topics has grown considerably in recent times. A particu- 
lar emphasis is placed on the analysis of data as a complement to descriptive 
statistics — in other words, on the art of learning from data. 

e Finally, the appendix includes topics that were difficult to place into the 
preceding chapters, such as complex numbers, Fibonacci numbers, and the 
whole subject of music and mathematics. 


The accompanying website www.uni-ak.ac.at/math 


This book is accompanied by its own website, which contains updates, ex- 
tra examples, additional web addresses to the various topics covered in this 
book, as well as dozens of executables that allow you to explore mathema- 
tical relations interactively. In particular, they make it easy to comprehend 
complex motions and physical simulations. 

The advantages of such a website are self-evident: It can always be kept 
up to date and augmented without changing the book at its foundation. 
Readers are encouraged to make frequent use of it. I am always grateful for 
constructive feedback! 


On the exactness of mathematics 


The following “mathematician’s joke” shows the discipline’s characteristic 
self-image: 

An engineer, a philosopher, and a mathematician are riding through the 
Scottish Highlands on a train and notice a black sheep within a herd. The 
engineer says: “I didn’t know there were black sheep in Scotland.” The phi- 
losopher immediately corrects him: “Hold on! This isn’t so easy. You should 
have said: I didn’t know there was at least one black sheep in Scotland.” 
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The mathematician cannot help but object: “Gentlemen, this is still not pre- 
cise enough. You should have said: I didn’t know there was at least one sheep 


oP) 


in Scotland with at least one black side ... 
Pure mathematics certainly concerns itself with degrees of exactness and 
descriptive precision that would be absurd in everyday life. In applying ma- 
thematics to solve practical problems, an engineer may interpret reality so- 
mewhat pragmatically, in order to achieve useful results more quickly. The 
truth may lie somewhere in between ... 


Acknowledgements 


A good book is the result of year-long preparations, based in interaction and 
cooperation with many individuals. 

For their contribution of interesting ideas and/or proof-reading, I would like 
to thank: Reinhard Amon (he contributed substantially to the “musical to- 
pics” which form the theoretical foundation of Appendix B), Franz Gruber 
(he came up with many ideas and several illustrations), Stefan Wirnsperger 
and Markus Roskar (hand-drawn illustrations), and Wilhelm Fuhs. For co- 
operation and correction, particular thanks are also due to my co-workers 
Franz Gruber, Gtinter Wallner, Eugenie Maria Theuer and especially Boris 
Odehnal, and also to the various interested readers that filled my inbox with 
suggestions for improvements, as well as typo sightings. 

Finally, I would like to thank all my students who brought great enthusiasm 
into the classroom, and who contributed substantially to the book through 
constructive questions and discussions. They have never failed to give me the 
impression that mathematics, if taught with excitement and engagement, 
does not have to be the mere concern of an ivory-tower elite, but has the 
capacity to address people from all walks of life. The fear of the “dreaded 
subject” of mathematics, still present in some of us, may one day be replaced 
by an attitude of “curiosity and fascination”. 


2 Equations, systems of equations 


In this chapter, we examine elementary knowledge of mathematical relations 
from a higher point of view. Having understood the elementary relations 
thoroughly, we will conclude that many deceptively complex problems are 
grounded in simpler principles. 

We will repeat the most elementary rules of algebra, such as the laws gover- 
ning power functions, and the rules related to solving linear and quadratic 
equations. 

Linear systems of equations will be particularly relevant to us, and may 
also be of interest to advanced readers, as they are not always trivial and 
often exhibit intricate interrelations with other branches of science, such as 
geography, physics, chemistry, photography, applied arts, and music. We will 
perform many calculations using real physical units. This will enable us to 
stay relevant to the solution of practical problems. 

Due to the high computational complexity involved, we will only deal with 
higher-order algebraic equations conceptually. However, we will still find 
practical applications for their use. 

At the end of the chapter, we will discuss further applications that cross over 
into fields of knowledge that are not exclusively concerned with mathematics. 
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2.1 The fundamentals of numbers and equations 


Calculations with floating point numbers and precision 


In the following section, we will perform calculations almost exclusively with 
real numbers. In general, our pocket calculator shows such numbers as deci- 
mals. Most calculators perform their task with an accuracy of +13-15 digits. 
We have to keep in mind that the numbers we enter into our calculator 
might be subject to imprecision (due to estimation, imprecise measurement, 
or rounding). A definitely correct digit (except zero, if placed in order to fix 
the decimal point) will be called a valid digit. Thus, the number 0.000123 
possesses at most three valid digits. We now have to consider the following 
theorem: 


After a multiplication or a division, the number of valid digits is equal to the 
minimum number of valid digits in all operands. The result of an addition 
or subtraction has no valid digits beyond the last decimal place where both 
operands possess valid digits. One might say that the chain is only as strong 
as its weakest link! 


>>> Application: average velocity 

Consider a simple equation in physics: v = s/t (velocity = distance over time). 
A car drives a distance of 88km in 1h6 min. What is the average velocity, 
and how accurately can it be calculated? 


Solution: 
1h6min is equivalent to 1h+ Sh = 1.1h. 
The above equation yields 


s 88km km 
~#” Tih cae ee 


Fig. 2.1 average velocity 


The distance of the route travelled was taken from a street map, which would 
indicate 88km, even if the actual distance is 87.500km or 88.499km. In a 
similar manner, most people might be satisfied with a time measurement 
that is exact with minute precision. Thus, the actual car might have taken 
1h5 min 30s or 1h6 min 29s, and we still would have entered 1h6 min into 
our calculation. However, the time measurement fluctuates between 

5 30 6 29 


th+—h+ —h=1.0917h and lh+—h+ 
60 3600 60 3600 


h = 1.108h. 
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We can, therefore, conclude that the “accurate result” might be found any- 
where between 


v= ee = 81.065 km/h and v= os = 78.971 km/h. 
1.0917 


1.108 
Therefore, it is correct to say that many decimal places of the calculated 
average velocity are baseless and may lead to misleading and unjustified 
impressions of accuracy! 


® Remark: Physicists tend to perform calculations in units of measurement. The 
velocity — including that of a car — is usually given in m/s, and time in seconds. 


Since x a =2 ee the following frequently employed relation is true: 
km 2m I km 
—=-——— and y—=3.6y — 
h 36s ¢ S I 
The following example demonstrates the risk of using units of measurement care- 


lessly. 
Consider the following — incorrect — chain of equations 


1€ = 100 Cent = 10 Cent - 10 Cent = 0.1€-0.1€ = 0.01€. 


Can you find the mistake hidden therein? ® << 


>>> Application: moderate but constant 

Every driver ought to know that it is unwise to make up for lost time by 
“speeding”. A moderate but constant velocity not only consumes less fuel 
(while also being less stressful), but is also more time-efficient than one might 
imagine. This provides an answer to the question “when to mount rain tires” 
in Formula 1 racing: Is it worth it to “put on” rain tires, which allow the 
vehicle to drive faster on wet surfaces, at the cost of a pit stop? 

Let us consider a simple and very idealized example (Fig. 2.2): A race course 
of s = 12km (where 6km run across easy and 6km across difficult terrain) is 
to be traversed by a runner (at a constant velocity of 12 m) and a mountain 
biker (4 an faster than the runner in simple terrain, and 4 slower than 
the runner in difficult terrain). Which participant takes less time to cross the 
finish line? How long must the trajectory across simple terrain be for both 
participants to finish at the same time? 


Solution: 


Time for the runner t; = 2 = th. 


6km : 6km 
(12+4) 82 © (12-4) 


Time for the mountain biker t2 = = 2h, or 73 minutes 
slower! 
If the mountain biker is to cross the finish line at the same time as the runner, 


then the sum of the partial times for the different terrains must be 1h: 


zkm | (12—a)km 


=lh>a2=8km. 
km km 
es Oar 
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. bea 


Fig. 2.2 the great race 


It follows that the easier terrain should cover two thirds of the entire course. 


® Remark: The “accordion effect” is an interesting phenomenon to note at this point: 
If a line of vehicles travels on the fast lane at a speed of 160 km/h, it may only take 
a single car, “breaking out” at 130km/h from behind a truck, to bring the faster 
vehicles to a full stop: The driver at the front has a certain reaction time, in which 
the velocity of 160 km/h may still be maintained. To avert a collision with the slower 
vehicle, the velocity needs to be reduced, until it is significantly lower than that of 
the obstructing vehicle. However, the same problem arises for the subsequent car in 
the fast lane. Sufficient distances between the cars help to avoid traffic jam scenarios 
of this kind. ® << 


>>> Application: overtaking calculations 

A car (speed v1) drives past another slower moving car (speed v2), travelling 
in the same direction. How much time does it take for the faster car to pass 
the slower one and how much distance must the faster car cover? 


Fig. 2.3 overtaking with dog leash as tape measure 


Solution: 

The speed difference is v1—v. If we assume the length of the slower vehicle to 
be A meters and we should be B meters behind the vehicle before overtaking 
and cut in C’ meters after the vehicle, we have to pace across As = A+ B+ 
C meters with a speed of Av = v1 —v2. The time needed for that is t = As/Av. 
Clearly, the process depends in an indirectly proportional way on the speed 
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difference. From the beginning of the overtaking process to the end, the faster 
car must cover a distance of Stota] = U1 t. 

Numerical example: 

v, = 108 km/h (= 30 m/s), v2 = 90 km/h (= 25 m/s), length of slower vehicle A = 10 
m. 

Good numbers for A and C might be 15 m. Then, we have As = 40 m, t= 8 s and 
Stotal = 30m/s- 8s = 240 m. PPP 


>>> Application: making up ground 

Two joggers A and B run together at a speed of 3.5 m/s. Then, A decides to 
run back, whereas B intends to continue for another 300 m, and then, turn 
around. He, therefore, immediately speeds up to 4 m/s, and separates with 
the words “I will join you soon”. When is “soon”? 


Solution: 
Jogger B gains 0.5 m every second. Thus, it will take him 1,200 seconds (20 
minutes!) to join A again. <<< 


Calculating with powers of ten 


a 
: {\\ 


Fig. 2.4 orders of magnitude in the animal Kingdom I: The lengths of the animals 
are 4,000 mm, 400 mm, 40 mm, 4 mm. 


The careful treatment of powers of ten is important whenever one uses ma- 
thematics, such as when estimating results or when correctly interpreting 
computer calculations. 


Fig. 2.5 orders of magnitude in the animal Kingdom II: The masses of the animals 
are 10° mg, 10° mg, 10° mg, 1 mg. 
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If n = 1,2,... is any natural number, then the number 10” represents a 1 
followed by n zeroes. When applying the rules for general powers that hold 
for any base, we obtain 


10° = 1, 10! = 10, 
1 nm 
1° 10” = 19°" ee io. 
10” 
1 
io” = ——, 0)” = 10™, 
10” cae) 
"10" = 10m, 


The numbers n and m do not need to be natural and can each take on any 
real value. 


>>> Application: car tire wear per kilometer (Fig. 2.6) 
How strong is the wear of a car tire’s outer surface if a brand new tire loses 
1cm in depth after being driven for about 50,000 kilometers? 


SE OR | 


Fig. 2.6 tire wear... Fig. 2.7... with forward acting turbines 


Solution: 
If the outer surface wears down about 1cm = 107?m after 50,000km = 5- 
10*km, then its wear per kilometer is 


10m 10-107%m 
5-104 ss - 104 


= 2-10-'m = 0.2-10~° m = 0.2m (micrometer). 


® Remark: A car tire has a diameter of approximately 65cm (26 inches). Tread 
wear reduces the tire’s diameter by a centimeter, and therefore, the circumference 
by approximately 2-1.5%. The reduction of speed necessary to prevent being caught 
by speed cameras at 100 mu is at least 3 Mu Tires for aircrafts wear out much faster 
and have to be replaced more often than those for other types of vehicles. Fig. 2.7 
shows a landing on one of the shortest runways in the world (Madeira) and — due 
to the need to reverse thrust — the shimmering hot air emitted by the turbines. @ 


ddd 


>>> Application: hard disk space 
Suppose a hard disk has 100GB (gigabytes) of storage. How many typed 
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pages (75 characters per line, 40 lines per page), or alternatively how many 
color photos (at a resolution of 1,000 x 1,000 pixels) can be stored in an 
uncompressed form? 


® Remark: Note: 1 typewritten ASCII character takes up 1 byte of space. A color 
pixel occupies 3 bytes (red, green, and blue components at 1 byte each). 1 gigabyte 
is equivalent to 1,024 megabytes, 1 megabyte to 1,024 kilobytes, and 1 kilobyte 
to 1,024 bytes. Nevertheless, we can estimate 1GB as being equivalent to roughly 
10° bytes. Images are rarely stored in uncompressed bitmap format. Modern digital 
cameras allow taking photos at resolutions of 3,000 x 2,000 pixels or more. A 6 
megapixel image would require 18 megabytes of space on the medium of storage. 
Instead, images tend to be stored in compressed form (JPG format) and can thus 
be fitted into about 2 megabytes — depending on the degree of compression and the 
type of the image. ® 


Solution: 
1 page = 75-40 = 3,000 characters, 


100-10" ~ 30-10° = 30 millions of pages, 


1,000 x 1,000 occupy 3-106 bytes => 1910" « 30-103 = 30,000 images. <<< 


>>> Application: measuring distances through GPS 

Using GPS (Global Positioning System), it is possible to determine one’s 
position at any point on Earth up to a few meters. We will discuss in Ap- 
plication p. 363 how these remarkably precise positions are calculated. In 
summary, it is necessary to take accurate measurements of one’s distance to 
three satellites. These satellites continuously transmit characteristic signals 
that propagate at the speed of light. In order to determine the distance of 
the satellite, one must measure the time that it takes for the signal to re- 
ach the receiver, and multiply it by the speed of light. How accurately must 
such time measurements be taken so that the accuracy of position would fall 
within 1m? 

Solution: 

The speed of light 300,000 “8 = 3-10° 8 = 3-108 © implies that the time 
for the signal to cross a meter is 3 - 10-8 s x 3.3-10-*s. 

One should be able to measure time to an accuracy of a nanosecond. Even 
in a microsecond, the GPS signal will already have travelled a distance of 
300m. 


® Remark: Even atomic clocks (which are used on satellites) can “only” measure 
microseconds reliably (the magnitude of error of such a clock equates to about 1 
second in 6 million years). To overcome this difficulty, the GPS employs a trick: 
The characteristic signal encodes a description within the wave itself: Since it has a 
period of 300m in length, the exact position within the oscillation at the time the 
wave is received can be observed. Thus, it is possible to say how many nanoseconds 
have passed since the last microsecond. ® <<< 
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>>> Application: of billions and trillions 

Comparing the English language with other languages, the big numbers have 
names that are “false friends”. This leads quite often to misunderstandings 
and wrong translations. 10° is usually a “billion” in English, and a “milliard” in 
most other languages. 10! is called a “trillion” instead of a “billion” elsewhere 
(https: //en.wikipedia.org/wiki/Names_of_large_numbers). So, how far 
is a light year, how much is the gross product of Germany, when you find a 
German website with the content “Deutschland/Bruttoinlandsprodukt 3,73 
Billionen USD (2013)”, etc. 

Solution: 

In the case of the light year (9-10'*km): “six trillion miles”, 

In the case of Germany’s gross product: 3.73 billion USD (2013) or 3,73 
Milliarden USD, or, much better: 3.73-10° USD (2013). <<< 


Manipulations of equations 

Applied mathematics is constantly working with formulas, which consist of 
relations between given values (which may be variables or constants) that 
output new values (variables). Usually, there exists a given “explicit solution” 
for a preferred variable. If one wants to calculate another variable within such 
a formula, one must transform the whole equation. We will now briefly remind 
ourselves of the main rules which may be used during such transformations. 


Elementary operations 


Equivalence transformations are elementary (addition and subtraction as well 
as multiplication and division of equal terms or quantities on both sides of 
the equation, except division by zero). So, we find 


a=bsatc=b+cec or a-c=b-c, 
a b 
a=b<ac=bc or —=- (c#0). 
Cc 
The product of a sum quantity and a sum is the sum of two products (dis- 
tributive law): 
ab+ac=a(b+c). 


One often needs to employ 


(a+b)? =a? 42ab+b? and (a+b)(a—b) =a?-b?. 


Computing with powers (of the same base): 


Coane asa, and (ay Sa". 


When raising values to a given power, parentheses are important, since 
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(ay £ a"), (2.1) 


@ Remark: For example, (1010) '° = 10! is a number with 100 zeros, and 10(0"°) = 
1910000000000 i; » number with 10 billion zeros. Even 4(4") = 4256 ~ 10/54 has 154 
digits. The largest number that can be written using three digits is obviously 99"), 
This number cannot be represented on any computer (depending on the processor 
being used, the largest values used by computers fall within the range of 10°°°), but 
can only be processed with software for algebraic computation, such as Derive. ® 


Representing fractions 
“Equating fractions”: 


oe ad=be. 

b d 
It follows from 4 = : that a = b and vice versa. Resolving double fractions 
means: 


: @- 9c | ud 
c be i be 


“Common denominator”: 


a d ae+dc 
be be bce 


Proportions, intercept theorem 


Let us now address axioms, which are simple and apparently intuitive state- 
ments that are, nevertheless, unprovable, but that must be assumed in order 
to deduct logical conclusions from them. An example of this is the intercept 
theorem: 


a 


A By A BB, 


Fig. 2.8 similar triangles, intercept theorem 


Consider a triangle ABC, whose side lengths are represented by a, b, c mul- 
tiplied by a constant factor k (the similarity factor). Thus, it is possible to 
describe a similar triangle A; B,C) (Fig. 2.8) with side lengths 


a,=ka, bb =kb, cq =ke. 


These relations of similarity are not limited to the specific triangles ABC and 
A, B,C\. In fact, they can be applied to any triangle with identical distance 
proportions. 
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Similar triangles have equal angles. 
Furthermore, we have: 


AB: A,B, = AC: AiC, and AB: A,B, = BC: ByC\. (2.2) 


If we move the triangles inside each other so that the legs of an angle join 
the legs of a corresponding angle of the other triangle, then the remaining 
two sides are parallel. The two ray sets are then described by formulas (2.2). 
In other words: 


If you intersect two neighboring sides of a triangle with a pair of parallel lines, 
then the ratio of the segments on one side equals the ratio of the segments on 
the other side, and it also equals the ratio of the segments on the parallels. 


>>> Application: lens equation (Fig. 2.9, see also Application p. 54) 
Consider a spherical, symmetrical, and convex lens such as a magnifying 
glass. A “thin lens” (where the spherical radius is much larger than the lens 
thickness) is a good approximation for light rays near the optical axis adhe- 
ring to the following three laws (using the notation of Fig. 2.9a): 


1. Any ray directed towards the lens center Z, “principal ray” PP*, is not 
refracted. 


2. Rays parallel to the optical axis FF are refracted so that their refractions 
are concurrent in F’ on the other side of the lens. 


3. The outgoing rays of an object point P are refracted so that their refrac- 
tions concur in an image point P*. 


Let the distance of the object point P to the lens center Z be denoted by g 
and 1Z =: b (see Fig. 2.9). 


Fig. 2.9 lens equation 


Deduce the lens equation, in other words the relation between g and b. 
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Solution: 
The triangles P2P* and ZF'P* satisfy the conditions of the intercept theo- 
rem, which proves our claim, since 


P*P:P*Z =P2:ZF =: f. 
Analogously, for the triangles P3P* and Z1P: 


PUP SP = Porta (ob) sb. 
This yields 


g gtb : 1 gt+b_ g , b 
fo ob  f bg bg bg 
The simplified version is the easy-to-remember lens equation: 


ee + : (2.3) 
fo bg , 
In practice, we often know f and g. The distance b of the image can be cal- 
culated by Formula (2.3) where it is multiplied by the common denominator 
fgb. This causes the other elements of the equation to move to one side of 
the equation. After that, b “cancels out”: 


fg 
ne 


bg=fgtbf>bg-bf=fg>b(g-f)=fg>b= 


SY 


If we introduce f as a “scalar” and set g =k f, we have 


—— 


Fig. 2.10 convex lens (magnifying glass) with fuel and a magnifying effect 


In Fig. 2.9a, the image of the (real) object is reduced and upside-down. It is only 
magnified if we move closer to the object so that g < f. In that case, the image is 
upright but virtual (see Fig. 2.9b, Fig. 2.10): 


0<g<fs0ck<l> ro <0. 
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Fig. 2.11 concave lens (concave mirror) with fuel and a magnifying effect 


The value k = 1 (f =) is obviously a critical value (division by zero!): Thus, 
the image is “infinitely large” and is also “infinitely far away”. For a very large 
k (k > o), the factor — converges towards 1, and the value of b converges 
to f accordingly. 

Rays parallel to the optical axis are bundled in the opposite focal point. 

@ Remark: A similar property of “burning mirrors” was allegedly utilized by the 
Greek mathematician Archimedes (298 - 212 BC) to sink Roman ships (Fig. 2.11, 
left). However, the accuracy of this legend is doubted, because such an undertaking 
in practice would be quite difficult. After all, cases are known where dried grass igni- 
ted due to the internal effect of dewdrops. Anyway, one can still count Archimedes 
among the greatest mathematicians. He lived in Syracuse and in Egypt, where he 
discovered the respective laws for the lever and the lift. He also gave the first good 
approximation of the mathematical constant 7, and even of integral calculus (!). On 
a side note, he was also the inventor of the Archimedean screw. ® <<< 


Real halfspace 


Virtual halfspace 


principal plane 
corresponding to 
sensor plane 


< 
= 
s 
o 
oO 
= 
re) 
cs) 


plane 


Fig. 2.12 To each object in space, the lens system creates a collinear virtual object. 
The sensor plane corresponds to a plane perpendicular to the optical axis. Perfectly 
sharp points in a photo lie in that plane. 


>>> Application: spatial images produced by lens systems (Fig. 2.12) 
The illustrated ideal lens system generates for each point P a virtual point 


2.1 The fundamentals of numbers and equations 19 


P*. The pixel on the chip is the — theoretical — section of the ray PZ with 
the sensor plane. 

In reality, there is no “pixel”: It is rather a circular “image spot” on the sensor 
(chip), which is formed by all the light rays emanating from P, going through 
the circular aperture, and forming an oblique circular cone.! This circle is 
called the blurry circle in photography. <<< 


>>> Application: parallel resistors (Fig. 2.13) 

The lens equation also occurs in an interesting manner with resistors connec- 
ted in parallel. Denote by R; and Rez the resistance of two resistors connected 
in parallel (so-called partial resistors), then the total resistance R of the sys- 
tem is given by: 1/R=1/R, + 1/Ro. 

The total resistance is always less than each partial resistance. Assuming 
that the total resistance R = 50Q and one partial resistance Ry = 75Q are 
given, then we can find the other partial resistance Ro: 


_ RR, _ 500-750 


R = = 150 Q. ddd 
2" RR %Q-500 

I, R, 
P > 

I R, 
I} U 

| 
Fig. 2.13 parallel resistors Fig. 2.14 performing task 


>>> Application: speeding up a process (Fig. 2.14) 

A machine completes a task in 8 hours. How much time will it take a second 
machine to complete the same task, so that both machines can do the job in 
4.8 hours? 


Solution: 
Let W denote the total work. From physics we know that Power = Work 
/ Time. 


The performance of the first machine is P, = x and the performance of the 
Ww 


second machine is P) = v The performance of both machines equals P = 73. 


Since P = P, + P, the equation 
WwW W W 1 1 1 
= > —+-—= 
8 « 48 8 «a 48 
is a “lens equation” from which x = 12 can be calculated as in Application 
p. 16. << 


For further information on this topic, see G. Glaeser: Geometry and its Applications. Springer 
Verlag, New York, 2012. 
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Equations involving a single root 


>>> Application: compound interest (Fig. 2.15) 
The formula for the return of an output capital Ko over n years, with annual 
capital (interest rate p), reads 


K = Ko (1 + a 
100 
Let the initial deposit be Ko = 10,000.00€. After five years (with fixed 
interest rates) the resulting capital A = 11,000.00 € (already reduced by the 
capital gains tax) will be paid out. How much is the net interest rate (without 
taxes)? 


Solution: 
Firstly, we expect that, in general, 


Kk _ , p \r p_n/kK 
Ko (Le hy) => 1+ 365 Ko 


= p= 100 (*/# -1) 


and calculate p 
11 
p = 100 (i) ee i = 1.92. 
Fig. 2.15 compound interest 10, 000 


® Remark: The recommended strategy is to reshape a formula so that it becomes 


generalized in such a way that one can only use numerical values. Firstly, calcula- 
tions may become more accurate, and secondly, this reshaped formula can be used 
with arbitrary numerical values. ® <<< 


>>> Application: mathematical pendulum 

To determine the gravitational acceleration g at a particular place on Earth, 
one must measure the period of oscillation T of a simple pendulum of length 
L. What is the value obtained for g (in m/s?) if we apply the following 


formula: 
L 
es am) (2.4) 
g 


T= any /E es An? 2 > gT? = 4n?L => g 4n? 4. 
Numerical example: 
L=0.5m, T = 1.420s > g = 9.789m/s? (Beware of the dimensions!) 


Solution: 


® Remark: Later, we will derive Formula (2.4) with the aid of integral calculus 
(Application p. 420). Note, incidentally, that the period of oscillation can be mea- 
sured very accurately since it depends on the angle of deflection and, provided the 
pendulum is built so that the angle of deflection is sufficiently small, the period is 
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independent of the oscillation amplitude. The pendulum is not, per se, a “perpetu- 
um mobile”; so, energy is to be added constantly — in this case by means of a towing 
weight. ® <<< 


>>> Application: maximum speed in freefall 

Following Newton’s Formula (2.5), the air resistance depends on various fac- 
tors, including the square of the speed v. Determine from this the maximum 
speed in free fall. <<< 


Solution: 
According to Newton, we have 


2 
Fw = cw Aas. (2.5) 


Here, g is the density of the flowing fluid (gas or liquid) and v is the velocity of 
the body relative to the fluid. A is the cross-sectional area of the body and cw 
is the drag coefficient specific to the body-type indicating how streamlined 
a barrier is. It is noteworthy that the force Fw depends only on the mass m 
(to a certain degree and via the cross-sectional area). 

For the freefall we have: 

ma=mg- Fy. 
The current acceleration equals a (0 < a < g). The weight ma (usually mq) 


decreases with increasing resistance. If a reaches the value zero, then the 
maximum speed is reached. Then, we have 


2 
Umax 2mg 


=> U = . 
max cw A 0 


mg=cw Ae 


Numerical example: A human body with A = 0.8m?, a drag coefficient cw = 0.8, and a mass of 
70 kg achieves at an air density of 1.3 kg/m? a terminal velocity of 


2-70kg- 10% 
Uae = ——_—_— x 41m/s 
0.8- 0.8m? - 1.3=5 
m 


(approximately 150km/h). An ant that is 1 cm long with a mass of 40 mg, with the same cy, and 
A =0.4cm? only achieves 


2-40-10-Skg- 1035 
0.8-0.4- 10-4m? - 1.38 
m 


x 4.5m/s 


Umax = 


(see Application p. 111). 

>>> Application: plunge into water (Fig. 2.17) 

What is the average deceleration experienced by a diver when the diver jumps 

from an altitude of Hm into the water, and he reaches a depth of Tm? The 

following formulas come into play: 

Immersion speed is vp = \/2g H (deduction of the formula in Application 
2 


p. 59, Formula (2.29)), uniform deceleration a = sh (derivation of the formula 
in Application p. 59, Formula (2.27)). 
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Fig. 2.16 different styles of diving ... 


Solution: ‘ 


We substitute vg = /2g H into a= om and obtain 


_ (29H)? _29H _gH _H 
Oe OE men a 
The average deceleration is directly proportional to the height H from which 
the diver jumps and “indirectly” proportional to the depth T' of immersion. 


Numerical example: With a dive from a five meter diving board (H = 5m) and 
an immersion depth of T = 4m, a negative acceleration of about a quarter of the 
gravitational acceleration is reached. 


Fig. 2.17 going loco down in Acapulco 


Much more extreme, of course, is the practice of the famous “death jumpers” of 
Acapulco/Mexico. They jump from a height of 40m and have to roll quickly when 
submersed in water, because the water at the immersion point is only 3.6m deep. 
The deceleration is then 11g on average. 


® Remark: The formulas are also valid when applied to the frontal impact of a 
vehicle crashing against a wall with an airbag of great quality. The deformation of 
the hood of a car and the relatively small breaking distance by the airbag together 
produce about T x 0.8m. Upon impact at 72km/h = 20m/s, there is an average 
deceleration of about 25g. This is almost at the limit (30g) of what the human 
organism can survive in the short term. ® << 
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2.2 Linear equations 


General information about algebraic equations 


Mathematicians distinguish between algebraic and transcendental equations. 
For algebraic equations, clear statements about their solutions (number, mul- 
tiplicity) can be made. One often has to compute the zeros of a function 


fe) =e Gat” - ie eae + ag: (2.6) 


Mathematicians often use the “sum notation” 
a k 
f(a) = Di age”. 

k=0 


The natural number n is called the degree of the equation. The algebraic 
equations of degree one are called linear equations; those of degree two are 
called quadratic equations. 

A value « is called a solution or a root of an equation if f(a) = 0 is satisfied. 


The simplest special case of an algebraic equation 
For n = 1, Formula (2.6) has the form 


f(2) = ax) + a9 
which is usually written as 
y=katd. 


If & + 0, the equation has exactly one solution 
d 
gO ed De 


>>> Application: linear profit function (“naive approach”) 

A product is to be produced and sold to a wholesaler. The necessary initial 
investment amount is Kg MU (monetary units). The cost of generating each 
piece is E MU. Up to what number of pieces is the business “in the red”, if 
one can arrive at a selling price of V MU per piece? 

Solution: 

The profit function for x units sold is linear in this simplified case 


y=Va-(Ko+Ex2)=(V-E)2- Ko. 


The zero is obviously the critical number of pieces. Later on, one starts to 
make a profit. This is the case for 


Ko 
V-E 


(V-E)x-Kj=0>27= 
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>>> Application: conversion between temperature scales 

The unit for measuring temperature in physics is Kelvin, in Europe it is 
Celsius, and in the US it is Fahrenheit. The conversion from Celsius to Kelvin 
is easy: K = C+ 273.15°. A conversion that is particularly simple is the one 
from Celsius to the Réaumur scale (Fig. 2.18): R= 2C. In order to convert 
from Celsius to Fahrenheit, observe the following: 0° C corresponds to 32° F’ 


and 100° C' corresponds to 212° F. Determine conversion formulas between 


the two scales. 


Fig. 2.18 Rarely seen: On Réaumur’s scale, water also freezes at 0°, but it starts 
boiling at 80°. 


¢ 
¢ 
| 


= 20 


HINT 


ait 


| 


Solution: 
We first describe the temperature in Celsius as c and that in Fahrenheit as 
f. Then, we can apply the linear ansatz 


f=kc+d. 
We insert the two nodes, and thus, we obtain two linear equations 
32=O0k+d and 212=100k+d 
which immediately yield d= 32, and thus, k = 2 = 1.8. So, we arrive at 
f= 2¢ +32 or equivalently c= 2 (f - 32). 


>>> Application: expansion of a body when heated 
If the temperature of a body with volume V, is raised by At, its volume is 
increased to V2 according to the following formula 


Vo =Vi(1+7At). 


Determine the material constant + if a given body’s volume increases by 3% 
at At = 50°. 


Solution: 
Reshaping the given formula yields 
1 (Va ) 1 0.03 _4 
= 1}= 1.03-1) = = 0.0006 = 6-10. 
1 Ret G 50 )= 59 
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® Remark: The material constant y and At conform to the same principle regardless 
of whether we measure in Celsius or in Kelvin — as is common in physics (see 
Application p. 24). ® <<< 


Imperial vs. Metric system 


>>> Application: conversion bar <= psi 

On many pressure gauges (e.g. on diving tanks), one can find values either in 
bar or in psi (pounds per square inch). A diver knows: A full tank is either 
x 200 bar or x 3,000 psi. Calculate the exact conversion numbers of this 
linear proportional relation. 


Solution: 

By definition, we have 1 bar = 10°kg/(ms”) = 10°N/m?. 

With 1 N * 0.22481 pounds and 1 meter * 39.37 inches (1”=2.54 cm), we 
then have 1 bar = 22, 481/39.377 psi = 14.5 psi or 1 psi = 0.069 bar. 

® Remark: Note that this was not trivial, since we must not confuse masses and 
weights (forces). A pound is, physically speaking, a weight. 

One bar equals approximately the ambient pressure at sea level. Every 10 meters 
under water, the pressure increases by one bar. ® <<< 


>>> Application: conversion miles per gallon = liters per 100 km 

There are some things in daily life that are easier to understand with the 
use of some simple calculations. US-Americans are more familiar with “miles 
per gallon”, and Europeans only think in “liters per 100 km”. What is the 
formula for this indirectly proportional relation? 


Solution: 

1 fluid gallon equals » 3.785 liters, 1 US mile * 1.61 kilometers (therefore, 
e.g., * 62 miles equal 100 km). 

Suppose, we are driving such that this would be equivalent to using «x liters 
per 100 kilometers and result in a distance of y miles per gallon. 

Let us assume that we continue driving at this rate until we have used up 
a gallon. This is the case after 100-3.785/z km or 100-3.785/1.61/2 miles 
(* 235/x miles). Thus, we have 


y » 235/x. 


@ Remark: For example, five liters per 100 km would be equivalent to 235/5=47 
miles per gallon. Note that you do not need a computer with Internet access to 
obtain this result; it may be quickly estimated by means of a mental calculation. 
By the way, this simple formula was not easy to find anywhere. @ daa 


The following two questions were part of a written exam taken by the author 
in order to acquire an American driver’s licence. Due to unfamiliar measuring 
units, together with questions about “turn on red” and special regulations 
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concerning school buses (unusual for Europeans), even seemingly simple tests 
can become critical ... 


>>> Application: How long is the braking distance at 55 mph? 
The proposed answers were a) 143 ft, b) 243 ft, c) 443 ft. What to choose? 


Solution: 

Firstly, how many meters per second is 55 mph? We have v = 55mph = 
55-1.61km/h = 88.5km/h » 25m/s. 

Secondly, a deceleration of a * 0.6g * 6m/s” might be a good value. 

Then we get the time for the braking process: v = at => t = 25/6s * 4s. 

This, finally, leads to the braking distance s = a/2-t? » 3-4? ~ 50m * 165 feet. 
Therefore, the author chose answer a) ... 


® Remark: The total stopping distance is the sum of the perception distance, the 
reaction distance, and the actual braking distance (once the brakes are put on). ® 
ddd 


>>> Application: How many ounces of 86-proof liquor have the same 
amount of alcohol as a six-pack of beer? 

This question is less hard to answer once you know what 86-proof liquor 

means .... 


Solution: 

A six-pack has 6-12 = 72 fluid ounces. The alcohol content by volume of 
regular beer may be 5 percent, that of 86-proof liquor is 43 percent, that is, 
almost 9 times as much. Thus, the estimated answer would be 72/9 = 8 fluid 
ounces. <<< 
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2.3 Systems of linear equations 


Linear equations in two variables 


The solution of a linear equation is so “trivial” that it needs no further expla- 
nation. It is scarcely any more difficult to solve a system of linear equations. 
Let us consider some introductory examples: 

>>> Application: mixing task 

How many liters of hot water (95° C) does one need to raise the temperature 
of n liters of cold tap water (15°C) to that of bathwater (35° C)? What is 
the ratio of cold and hot water? 


Solution: 

Let x and y be the amount of cold water and 
hot water respectively. 

1. Mass comparison: rt+y=n 
2. Energy comparison: 1574+95y=35-n 
Multiply the first equation by 15 and sub- 
tract it from the second equation. One will 
thus need 7 liters of hot water. The ratio of 


cold : hot is, therefore, 


pepe Ses Sas: 


| 


® Remark: One could also have solved the above example as follows: The tempera- 
ture difference between bathwater and cold water is —20°, and 60° for hot water. 
In this case, we use the energy comparison equation —20 2 + 60 y = 0, from which it 
also follows that x: y=3:1. 0 <<< 


>>> Application: total resistance and partial resistance 
Two resistors Ry and R2 connected in (Fig. 2.13) behave like 1: and have 
a total resistance of R. What are the individual resistances? 


Solution: 
For resistors connected in parallel, it holds that (see Application p. 19) 


1 1 1 


— +=, 
Ri Ro R 
By assumption, we have 
1 


1 
R,:Rg=1: — en 1 
1 2 aaa Ro nm 


1 1 
We now set «= — and y = —. Then, we have two linear equations 


Ry Re 


Sty=s, Bty=N. 


R 
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Following the second equation x = ny, and thus, from the first equation, 


ee espectivel ie € 
res ively x = : 
e - n+1R 


1 
Dina pee reesei 


Since we had two unknowns in the above examples, we needed two equations. 
By skilfully adding, subtracting or using “substitution”, we were able to solve 
such a “(2,2)-System” quickly. 

We now derive formulas for the solution (2/y) of a general system of linear 
equations in two variables, so we do not always have to worry about using 
such “tricks”. 

A (2,2)-system has the general form 


(I) auxtapy=hi, (2.7) 
(I) agi t+ Aa22Y = bo. 
; ait bi. 
Both of the equations are linear, for we have y = -—~x + — (i= 1,2). We 
a2 §Ai2 


want to find a pair of values (2/y) that satisfies both equations. In order to 
“eliminate” one of the unknowns, we multiply (I) by a2; and (II) by aj, and 
get: 


(I) agi a1, @+ agi dig y = agi by, (2.8) 


(II) aii @a1 @ + a11 G22 y = a1 bo. 


Now, we subtract the upper from the lower equation, and thus, we get 
(I) = (1) O-xr+ (ay, aQ2 — a21012) Y=aQ11 bo — a21 by (2.9) 


or 
ai1bg-a21b; _ Dy 


a11.422-a21412 D~ 
Now, x can be calculated from (J) or (JJ). The result is 


a226;-a12b2 — De 


@11422—a21 412 D 


The result in this form is ideally is verified by computers; it is easily pro- 
grammable. It must be mentioned that there exist more efficient methods 
for solving linear equations. This applies, in particular, to systems of linear 
equations with three or more variables, but they require — at least program- 
matically — much greater effort, because different cases have to be considered. 
We gave formulas for x and y that always work as long as the denominator 
D is not 0. If the denominator “disappears”, then the system has no solution. 
The only exception being: The equations are (scalar) multiples of each other. 
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Then, of course all (infinitely many) pairs (a/y) satisfying equation (J) are 
solutions of the system. 
It is difficult to learn these formulas by heart, because they involve confusing 
indices of the coefficients. Here we are aided by Cramer’s Rule (Gabriel 
Cramer, 1704-1752, a Swedish mathematician and philosopher) — a service 
that will subsequently be important. It is based on the common “matrix 
notation”, in which system (2.7) — omitting variables x and y for short — is 
written as follows: 

by 

] 


The right column can be described as a “spare column”. From this matrix, 
one builds the three “determinants” (square number schemes) as follows: 


aii 12 
a21 422 


bi a2 
bg age 


_ fai by 
74 Jao1 be 


aii a12 
a21 422 


D= , (2.10) 


? 


The sub-determinants D, and D, arising from the “major determinant” D are 
formed by respectively replacing the x or y column by the spare column. The 
“value” of such a determinant can now be defined as the difference between 
the product of elements in the principal diagonal (from top left to bottom 
right) and the product of the members in the anti-diagonal, for example, 


Q@11 12 


= A11 492 — A921 A212. (2.11) 
a21 422 


>>> Application: intersecting two lines 

Where do the lines 27+ 4y=5 and 3x-5y=0 meet? 

Solution: 

The intersection of two lines in the plane leads to a (2,2)-system. The de- 
terminants that are sought in this case are 


2 4 5 «4 2 5 
p-|; ‘--2 (+ 0), De=|j S725 Dy= [5 a|=-15 
‘ ‘ : 25 7/15 
According to Cramer’s Rule, we have the intersection s(5/5). See also 


Application p. 57. 


Fig. 2.19 three cases when intersecting two lines 
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® Remark: If the principal determinant vanishes, the lines are either parallel — so 
that the “point at infinity” S. is the solution — or they are identical — so that every 
point is a solution (Fig. 2.19, the situations depicted in the center and right). 

If one uses the “substitution method”, i.e. expressing either variable by the other and 
substituting them into one equation, one recognizes the exceptional case of D = 0 as 
follows: When the lines are identical, one obtains a “true statement” without further 
information about x and y; if they are different, then it is a “false statement”. 

When calculating with computers, it may be quite advantageous to work with 
“points at infinity” so that, instead of using D = 0, we use a very small value 
that is about D = 10°1°. This saves us from addressing a large number of different 
cases since the result will be “true” for values of precise decimals. @ daa 


Linear equations in three or more variables 


The advantage of Cramer’s Rule becomes even more apparent when we solve 
three linear equations containing three variables, that is, a (3,3)-system. We 
can then follow the exact same rules that apply for a (2,2)-system (though 
this can be a bit tedious to prove by recalculation). We will just need to 
know how to calculate a “three-row” determinant: Perform this calculation by 
calculating three of the “two-row” sub-determinants; this is called “expanding 
by complementary minors”. This technique will be explained by means of 
several examples (Application p. 30, Application p. 32). Expanding by the 
minors complementary to the first column gives: 


Qi1 G12 13 
a21 422 423) = @11 
431 432 433 


a22 423 Qa12. 413 


a42 
e + A31 
a22 423 


; 2.12 
432 433 ( ) 


-a 
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We can obviously associate a;, to an element in the sub-determinant created 
by deleting the i-th row and k-th column. It should be noted that each 
summand is given a sign according to the following scheme 


+ -— + 
es (2.13) 


>>> Application: intersection of three planes (Fig. 2.20) 
We would like to find the common point S of the three planes 


2e+y=3, yt2z=0, y-z=l1 


(see also the chapter on vector calculus). 

Solution: 

We write the three equations in matrix form. It must not be forgotten that 
zeros have to be placed if a variable does not show up: 


21 0/3 
0 1 2 40 
0 1 -1}1 
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The principal determinant is now 


21 0 
Dao 1 2 - af 2 oh “feo | =-6. 
01 -1 


With the same sub-determinants, D, assumes the following value 


Sw 


Fig. 2.20 the intersection of 


three planes ... Fig. 2.21... including special cases. 
3 1 O 
1 2 1 O 1 0 
D,=|0 1 2 3 |-o a J=-7 
a 1 -=1 1 -l 1 2 
Finally, we have 
2 3 =O 2 1 3 
2 1 
D,y=|0 0 2 =a} * 4, D,=)|0 1 0-2 |=2 
0 1 -l 0 1 1 
; x Dy z 
According to Cramer’s Rule, now x = De pes , and the intersec- 


tion point has the coordinates 
7 12 1 
Shalala) 


® Remark: If the principal determinant vanishes, the lines of intersection of any 
pair of planes are either parallel or identical (Fig. 2.21). If the planes are identical, 
then each point of the plane(s) is a solution. The planes can also form a “pencil” 
when they have a common line. All of the points on this line are then a solution. 
This common line can also be the common “line at infinity” of three parallel planes. 
The planes can intersect each other even along three parallel lines (one of which 
may also be a line at infinity). Then, the common point at infinity of this line is a 
solution. In computer calculations — particularly in animations or simulations where 
such special cases can sometimes occur, but not always — one can continue in the 
cases with a zero determinant by setting the determinant to a very small value (e.g., 
D =107!°) in order to circumvent nasty case distinctions (see Application p. 29). ® 

ddd 
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>>> Application: A-connection, Y-connection (Fig. 2.22) 
When transforming a A-connection into a Y-connection, the following equa- 
tions occur (see Application p. 27): 

Rj (Rx + Ri) 
rRtri= = 

Ry + Ro + Rez 

The indices 2, 7, k run “cyclically”, starting with 7 = 1, 7 = 2, and k= 3. The 
values r; are to be expressed in terms of the R;. 


ae 


Fig. 2.22 A-connection, Y-connection 


Solution: 

Once again, we write down the three equations explicitly: 
T2- Pi = s Ro(R3 +R) =Ai, 
m+7rq = sR3(Ri + Re) = Ao, 
Por TrZ:) = 8. Ri (Re se R3) = A3 


(with s = 1/(R, + Ro + R3)). In matrix form, the scheme is as follows: 


10 1)A 
1 1 0| Ag 
@ 1.7 \ ds 


The principal determinant is now 


1 01 
efit dealt af sop Yoo, 

0 1 1 
Furthermore, we have 
A; 0 1 

D, = A» 1 Gl =cAle i|- 42 eae | |= At Aa Ao. 
1 1 1 1 1 0 

A3 1 1 


We expand the remaining determinants for calculating rg and r3 by expan- 
ding along the middle and right columns respectively: 


1 A, 1 


Dg=)\1 Ag O]=-Aj, 
0 Az 1 


1 0 


1 1 
0 1| * 42 


0 | - 4s 


1 1 
1 g|=A2+ A= Ah, 
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1 1 
0.1 


1 0 
0 1 


1 0 


+ As], 1 


0 A 
1 Ap} = Ay 
1 A 


3 


Ps 


[= Aa + Ar = Ae 


The new resistors are then 
a D; 7 Az+ Ay =A, 
dD > 


Yr; 


The indices 7, 7, k again run in a cyclical order, starting with 7 = 1, 7 = 2, and 
k =3. 

® Remark: One remembers formulas better when they display regularities. Often 
such regularities in output are an indication that one has calculated correctly. Ein- 
stein once said that a formula is only correct if it is “beautiful”. © daa 


>>» Application: parabola on three points (Fig. 2.23) 

There are very common techniques for the approximation of complicated cur- 
ves by simpler ones such as parabolas. A famous example of such a technique 
is the Kepler’s Fass-Rule (Fass = German word for barrel, see Section 8, 
page 413). For this rule, one derives the coefficients a, b, c of the equation of 
a parabola y = ax? + br +c passing through three points P(u;/v;). 


Fig. 2.23 quadratic parabola (3 points) Fig. 2.24 parabola of degree 6 (7 points) 
vs. cubic parabola (4 points) vs. cubic spline 


Solution: 
The coordinates of the three points have to “fulfil” the equation of the para- 
bola, so we immediately get three equations with three unknowns a, b, and 


st 
ur a+ujb+ce = VI, 
uzatugb+e = va, 
ugatugbtc = v3. 


We solve this by means of Cramer’s Rule. The principal determinant is: 


ue uy Ll 
D= us u2 1 


u; uz 1 
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® Remark: Generally, one can interpolate n+ 1 points by a “parabola of degree n” 
(Fig. 2.23 and Fig. 2.24). However, complicated curves in computer graphics are 
rarely approximated by parabolas of degrees higher than 3 — such curves tend to 
oscillate (especially at their end points, Fig. 2.24). Instead of choosing a higher 
degree curve, it is better to use a sequence of “cubic parabolas” which are joined 
as smoothly as possible. The overall approximating curve is a cubic spline. Such 
splines also require the solution of systems of linear equations. For more details, 
read the chapter on differential calculus (Application p. 290). ® <<< 


Systems of linear equations having higher degrees are not uncommon in app- 
lied mathematics. Often one can “linearize” complex problems by introducing 
additional variables. This allows us to obtain a simple system of linear equa- 
tions with many unknowns, rather than a complicated system with a few 
unknowns. 

Such (n,7)-systems can also be solved using an adjusted version of Cramer’s 
Rule. The determinants of degree n develop gradually by a summation of 
determinants of degree n - 1 until they reach a degree of three. The higher 
the level, the more complex Cramer’s Rule becomes, and one often uses other 
methods with computers leading to a more efficient way to obtain solutions. 
Here, we have a simple example of using a (4,4)-system, this can be solved 
by skilfully inserting determinants (the “substitution method”): 

>>> Application: combustion of alcohol in carbon dioxide and water 
Consider the “chemical reaction formula” when one has sufficient oxygen 
to burn alcohols — such as the ethanol found in sugarcane liquor, which 
decomposes C2H;OH — to COz2 and H20O. 

Solution: 

The following holds true 


nNy1° C2H;OH + Tip O2 =n3°: CO2 +7N4° ALO. 


The four unknowns n,; can be calculated incrementally if we compare the 
quantity of the atoms: 


Carbon (C) 2n1 =N3, 
Hydrogen (H) (5+1)n1=2na, 
Oxygen (O) ny +2ng=2n34+N4. 


The first striking observation: We have three equations but four unknowns. 
However, there is an additional condition: The n; must be integers. First, we 
substitute m1 = 1 and hope to obtain integer solutions for the remaining n;S 
(If we do not obtain such solutions: Chemists also reckon with fractions!). 
Now we only need to solve a (3,3)-system: 


2=n3, 6=2n4, 14+2n2=2n34+N4. 


2.3 Systems of linear equations 35 


The solutions can immediately be seen: The first equation evaluates to n3 = 2, 
the second one evaluates to n4 = 3, and thus, the third evaluates to ng = 3. In 
fact, all solutions are integers (otherwise you will have fractions or will have 
to try n; = 2,...). The chemical reaction formula is thus 


CoH;OH +3-O9 =2-CO2+3- H20. 


>>> Application: explosive combustion of gasoline vapor 

Gasoline, especially, contains alkanes that have 6 to 9 carbon atoms in their 
molecules such as CgHjg. This is used as a fuel due to the fact that the 
mixture of gasoline vapor and air combusts explosively into water and carbon 
dioxide given a certain composition. Calculate the coefficients of the chemical 
reaction equation 


ny: CgHig+ng-Oo=7n3-COo+n4- HO. 


Solution: 
We calculate the four unknowns n,; gradually by comparing the quantities of 
the atoms: 


Carbon (C) 8n1 =Nn3 
Hydrogen (H) 18n, =2n4 
Oxygen (O) 2n2=2n3+n4 


We have three equations but four unknowns. 
The n; must be integers. First let us put 
again n; = 1 and hope to obtain integer so- 
lutions for the remaining n;. 

The (3,3)-system is now 


8=n3, 18=2n4, 2ng=2n34+N4. 


The first equation gives n3 = 8, the second 

gives n4 = 9, and thus, the third gives nz = (16 + 9)/2 = 12.5, which is not an 

integer. Thus, we try n = 2. Therefore, the appropriate (3,3)-system is 
16=n3, 386=2n4, 2n2g=2n3+4+N4, 


which gives the chemical reaction formula 


2-Cg Aig + 25-O2 = 16-CO2+18- H20. da 
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2.4 Quadratic equations 


The pure quadratic equation 


The equation 
gz? = D 


has two real solutions if D > 0: 


t= VD, 2 = -J/D. 


The solutions for D = 0 are: x1 = x2 = 0. The solutions for D < 0 are complex 
conjugates. More on this in Section B. 


Fig. 2.25 graphical solution of x? = D Fig. 2.26 convenient paper format 


>>» Application: convenient paper format (Fig. 2.26) 

Let the side lengths of a rectangular sheet of paper be a and b with a ratio 
a:bsuch that two similar rectangles arise when folding the sheet. How should 
one read the aspect ratio? How long are the sides of the paper if the surface 
of the sheet is 1m?? 


Solution: 


2 
> P= = a=bv2 


boa 
b 65) OD 
(the solution a = —b\/2 is not relevant). 


If the area of the rectangle (1m?) is the only information given, b can be 
calculated, and thus, a as well: 


ab=b/2b=b°V/2=1 


= b= / = =0.84m = o=1.180m. 
V2 


This format is called AO. Repeatedly folding the sheet gives the formats 
Al = 0.841 x 0.595, A2 = 0.595 x 0.420, A3 = 0.420 x 0.297, A4 = 0.297 x 0.210 


; 1.32 1.29 12.9 8 
etc. (with areas 5m“, gm“, ,m*, jgm’, ..-). 


ddd 
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The general quadratic equation 
In the general form of the quadratic equation, 


Ar? + Br+C=0, 
the sign of the so-called discriminant 
D=B?-4AC 


determines the number of real solutions: For D > 0, it gives two real solutions; 
for D = 0, it gives one real solution; and for D < 0, it gives two complex 
conjugate solutions. 

The solutions can be given directly 


-B+J/D 
2A 
Proof : We first divide equation Ax? + Bx + C = 0 by A, and we get 
, B C 


x eee al 


£12 = (2.14) 


Now, we complete to a full square 


Goriergcr) 
x + + =0. 
2A A \2A 
Hence, we have a purely quadratic equation: 
(> 2) -(2) C_ B?-4AC 
"2A 2A) A 4A2 
The solutions are given by Formula (2.14). © 


y=AX+Bx+C y 


x=-B/(2A) 


Fig. 2.27 graphical solution Fig. 2.28 harmonic rectangle 


>>> Application: Golden Ratio, harmonic rectangles (Fig. 2.28) 
The side lengths a, b of a rectangle are chosen such that 6 < a where 


bia=a:(a+b) S07 =ab+'. (2:15) 


We then call this rectangle harmonic. Calculate the ratio of the side lengths 
(the Golden Number). 

In addition, prove the following: One rectangle divides into a square and a 
smaller rectangle, and this smaller rectangle is again considered to be har- 
monic (this ratio is called the Golden Section). 
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Solution: 
We first set b= 1 and then, 


1 a 2 

= => a -a-1=0 > 

a atl 

(-1)+/(-1)?-4-1-(-1) | 1: 
2 2 

We do not consider the negative solution (however, see Application p. 528). 


Thus 
_ i+ V5 
2 
Assuming that Formula (2.15) applies, we also get 


a? =ab+b? > a? -ab=b? > a(a—b) =? > (a—-b):b=b:a4. 


Q1,2 = 


a:b :1 1.62: 1. (2.16) 


Fig. 2.29 pentagram Fig. 2.30 Modulor, Parthenon 


® Remark: Harmonic rectangles or the golden ratio often occur in art.? Examples 
include the “Modulor” by Corbusier (Fig. 2.30, left),? the “magic pentagon” (“pen- 
tagram”, Fig. 2.29: AC : AB = 11/5 : 1), and the Parthenon (Greek temple, Fig. 
14/5 


2 


this can be found in Appendix B. @ <<< 


2.30). In nature, the remarkable number plays an important role. More on 


>>> Application: upward vertical throw (Fig. 2.31) 

Let vo be the initial velocity and let g be the gravitational acceleration. The 
air resistance in this example is negligible. 

Then, for an elapsed time of t seconds, the height h is given by 


h = G(t) - B(t) = vot - 22. (2.17) 


2 
G(t) is the component of the upwards uniform motion; B(t) is the component 
of the downwards uniformly accelerated motion. 

Compute t at given values of h and v9. 


?See http: //www.uni-hildesheim.de/~stegmann/goldschn.pdf for an interesting article on the 
historical development of the Golden Section. 

3Le Corbusier (Charles Edouard Jeanneret): The Modulor: A Harmonious Measure to the Human 
Scale Universally Applicable to Architecture and Mechanics and Modulor 2 (Let the User Speak 
Next). 2 Volumes. Birkhauser, Basel, 2000. 
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Fig. 2.31 upward vertical throw 


Solution: 
We collect together powers of t as follows: gt? —vgt+h=0 and find 


A=5, B=-w, C=h= D=(-w)? 4oh= 0% 2gh. 


This results in 
t _ v9 t v/vG — 2gh 
1,2 F . 
The values t; and tg are the times at which the thrown object reaches height 
h, the difference tz - t, being the time it takes to rise and fall back down to 
this height. For the highest point (the current height H), there must be a 


double solution: 


2 
D=0> 1)-2gH =0>H= 32. (2.18) 
g 
Conversely, measuring from the current height H, the initial velocity vg can 
be determined as: 


v9 = V2gH (2.19) 


® Remark: This formula unexpectedly provides a good service in Application p. 56. ® 


® Remark: If we obliquely throw upwards (with an angle tilted at a to the horizontal), 
the value of vp sina is used instead of vp (Application p. 142). ® <<< 


>>> Application: balance of the attraction forces (Fig. 2.32) 

Let m, and mz be two masses with centers of gravity separated by a distance 
d. The center of gravity of a body P is located at a distance x from the center 
of gravity of m,, on the line joining the centers of gravity of m, and mg. For 
which x will the attractions between m, and mz compensate? 

Solution: 

According to Newton, the attraction is proportional to the mass, but it is 
inversely proportional to the square of the distance. For the body to be in 
equilibrium, one must have: 
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Fig. 2.32 Where does P have to lie in order to be attracted equally by m; and m3? 


My m2 
— = ——— (0<a<d 
2 Gage 
=> m(d? -2dx +27) = mgr? > (m1 - m2) 2?-2midx+ md =0. 
os —o~-—”’ hana 
A B 


Now we turn to Formula (2.14): 


2m4d + J4mid? — A(m1 — m2)mid? _ m+ /mimy d 


1,2 = 
2(m1 -— m2) my —™M2 
Only one of the two solutions (namely the one associated with the “—”) is 
relevant (since the forces must have opposite signs). This implies x < d and 
my —./mim 
to (2.20) 
my — ™m9 


® Remark: It often proves to be convenient to consider only the ratio of the two 
masses. This could provide a better understanding of the relations. We can rewrite 
Formula (2.20) to show this by dividing the right term numerator and denominator 
by m, 


= ¢=— "4. (2.21) 


Now we employ a small trick: we have 
a-b a-b ol 
a2-b2 (a+b)(a-b) a+b 


me 


In our case, let a= 1 and b= : 
My 


Thus, we can simplify Formula (2.21) to get 


1 
x = ——— d. 


m2 
1+ aa 


Numerical example: For a spacecraft between the Earth and the Moon (which have 
a mass ratio of 1: 81, d* 384,000km), this means 
d 9 
f= ——— = — 
1+1/9 10 
So, the spacecraft must be about 346,000km from Earth, or rather 38,000km from 
Earth (measured from the centers of both the Earth and the Moon). © 
® Remark: While the attraction of the Sun is quite strong, it is completely offset 
by the centrifugal force (the Earth, the Moon and the spacecraft rotate around the 
Sun at about the same speed). ® <<< 
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2.5 Algebraic equations of higher degree 


In applied mathematics, algebraic equations of higher degree(s) such as 
n 
{ays » a,x =0 
k=0 


are not uncommon. The solutions of such equations are called the “roots” or 
“zeros” of the equation. Equations of degree three and four can be solved by 
means of rather complicated formulas without too much effort and can even 
be solved exactly with a detour via the complex numbers. 

Without going into too much detail or even using any “complex numbers” (see 
B), we only want to collect the most important facts on algebraic equations. 
The theory of these equations kept mathematicians busy over many centuries. 
The most important theorem was given and proved by C.F. Gaufs in his PhD 
thesis: 


The Fundamental Theorem of Algebra: An algebraic equation of degree n has n 
solutions when so-called “multiple solutions” and also when so-called ‘‘complex 
solutions’ are counted. 


® Remark: The German mathematician Carl Friedrich Gaufs (1777-1855) is one of 
the most important mathematicians of all time. He was involved in many fields of 
mathematics and its applications, especially in astronomy. Even up to the present, 
many of his elegant methods have remained unsurpassed. ® 

For us, an important consequence of the fundamental theorem of algebra (see 
Section 2, page 41) is the following: The number of real solutions is at most 
n, but it may reduce by multiples of 2. Multiplicities also count. 


Fig. 2.33 images formed by refraction 


>>> Application: refraction at a plane (Fig. 2.33) 
To determine the “break point” of a light ray at the planar interface between 
two “media” (a water surface, a surface of a thick glass plate, etc.), we must 
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solve an algebraic equation of degree 4, like f(x) = a4a*+a3 2° +a 2? +a, 0+ 


ag = 0. Two or four solutions are real. However, only one of these is relevant. 
It can, above all, be calculated accurately and efficiently by a computer. 


Fig. 2.34 right: heavily skewed beneath the surface 


® Remark: Consider the “elevation” of the pool bottom in Fig. 2.33. Both the photo 
on the left and the computer calculation on the right have a constant value as their 
depth! The right image shows four (!) dolphins that swim right above each other. 
The top dolphin is barely recognizable in the image. Also if we look at Fig. 2.34, the 
animals are “undistorted” above, but heavily skewed beneath the surface. ® <<< 


>>> Application: ray tracing 

Realistic computer images, to which we are now accustomed, are often ge- 
nerated by ray tracing programs. The “scene” is composed of “primitive al- 
gebraic building blocks”, such as flat polygons (triangles), spheres, cones, 
cylinders, or annular surfaces (tori). Lines of sight are considered by the in- 
dividual pixels of the screen (“picture elements”). They either disappear “into 
nothing”, or else they meet such blocks. The identification of all possible in- 
tersections of the line of sight is done by means of the rigorous and efficient 
method of solving algebraic equations up to degree 4. If several intersections 
exist, only the foremost — thus seen as the visible — is considered. <<< 


In general, when dealing with algebraic equations of a higher degree, we will 
have to make do with approximate solutions, as will be the case with non- 
algebraic expressions of the form f(x) = 0. We deal with this issue in the 
differential calculus section (Newton’s method, Chapter 6). 

Sometimes the degree of an equation decreases. This occurs especially when 


we can use a substitution of the form u = x7, u= 2°, etc. 


>>» Application: reducible equation of degree six (Fig. 2.35) 
Find all real solutions of the equation 


o® -27°+1=0. 
Solution: 


By means of the substitution u = x 
equation: 


3 we obtain the easily solvable quadratic 


w-2ut+1=0> u1,2 = 1 (double solution!) 


2.5 Algebraic equations of higher degree 43 


Since u = 2° = 1, a cubic equation arises. The latter has one real solution. 
Overall, our equation of degree six has, therefore, a — doubly counted — real 
zero. This solution might not be found by a program that works numerically. 
It will only estimate zeros (up to a certain precision) within intervals where 
the function changes sign (or vanishes). Thus, in this special case, it may 
find the only zero, but it will do so only by chance. << 


Sometimes trivial solutions of a higher degree equation split off so that the 
degree of the equation can be reduced. Under these circumstances, the re- 
maining solutions can be calculated exactly. 


Fig. 2.35 f(z) =2°-22° +1 Fig. 2.36 Cartesian quadrifolium 


>>» Application: algebraic curve of degree six (quadrifolium) (Fig. 2.36) 
Descartes’s quadrifolium is given by the equation 


(x? + y*)? = 27 2y?. (2.22) 


(An equation in x, y where neither variable is isolated like y = f(x) is called 
an implicit equation.) Find the up to six points of intersection with a straight 
line through the origin. 

Solution: 

In the general linear equation y = kx + d, we set d = 0 since the line passes 
through the origin (0/0). Then we insert into this Formula (2.22) and get 


[a2(1+k7)]? = 27k? at = (14 kb?) 0 = 27k 24. 


Now, we can cut out «* if we assume x + 0. Then, we have only a purely 
quadratic equation whose solutions can be directly written as 


27 k? k 
(1+k*)? 2? = 27k* > a19=4 ar me ; 
(1 +k?) JA + k2)3 


The value x = 0 counts in the algebraic sense as a four-fold solution. In fact, 
one can see in Fig. 2.36 that the curve meets a general line through the 
origin at two points S; and S2 (with the x-coordinates x; and x2) and it 
additionally intersects at the origin four times. 
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Fig. 2.37 generalization of the quadrifolium by varying the kinematic generation 


® Remark: The quadrifolium can be generated kinematically by a so-called planetary 
gear in which arod FR uniformly rotates about a fixed origin F’, while a second rod 
RC — of the same length — (Fig. 2.36) rotates about R with triple angular velocity. 
The same motion is generated by rolling a circle R on a fixed circle F'. The ratio 
of the radii is 1: 4. This generation allows a generalization by varying the ratio of 
the radii. This produces a trefoil, quintifolium, etc. The trefoil is a curve of degree 


four. The degrees of the other curves are significantly higher. See also page 542. ® 
ddd 


>>> Application: cubic curves (Fig. 2.38) 

The folium of René Descartes (1596-1650) is best described in the coordinate 
system that bears his name (Cartesian). Its cubic equation is a + y? = cary. 
The intersection with rays y = kx through the origin lead to simple linear 


equations: 23(1+ k?) = cka? > x = ae 


“hh 


Fig. 2.38 Two “classics” among cubic curves: The folium of Descartes and the right 
strophoid. Both curves are described by algebraic equations of degree 3. 


The strophoid is easily constructed: On each ray through the point A(-a/0), 
one draws those points which are at distance d from the ray’s intersection 
M = (0/d) with the y-axis. This yields two points P and P* on the curve. 
The curve has a double point at the origin, and its tangents there are the two 
angle bisectors of the coordinate axes. The algebraic equation of the curve is 
(a-y)y? = 27(a+ x). Again, points on rays can be found via a linear equation. 

ddd 
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>>» Application: the Delian cube duplication problem’ (Fig. 2.39) 
Legend has it that the Delphic oracle once suggested that the volume of 
the cube shaped altar in the temple to Apollo should be doubled. It was, 
therefore, necessary to determine the side lengths of two cubes whose volumes 
formed the ratio 1:2. The solution is, of course, the cubic root of 2: 


¢ -2a° > ax-a- v2. 


A remarkable approach is attributed to Menaechmus, who presented the 
problem to Plato’s school. In his approach to the problem, he used curves that 
would later be classified by Apollonius of Perga (262-190 BC) as parabolas. 
We consider four cubes altogether with side lengths a, x7, y, and 2a. Let the 
volume of the second cube be twice that of the first, the volume of the third 
be twice that of the second, and that of the fourth be twice the volume of 
the third. 


Fig. 2.39 left: Menaechmus’s method, right: intersection of quadratic parabolas 


Then, from a: 2 =x: y=y: 2a, we obtain the system of equations 
x? = ay, y? = 2ax. 


These two equations can be represented graphically as parabolas through 
the origin whose remaining point of intersection yields the side length of the 
desired “intermediate cube.” By manipulating the equations, the system can 
be interpreted as the intersection of a circle with an equilateral hyperbola: 


x+y" = 2ax + ay, x — y” = ay - 2ax. 


In each case, it amounts to a solution that the classical Greeks might have 
deemed inelegant. a 


4G. Glaeser, K. Polthier: Bilder der Mathematik, Springer Spektrum, Heidelberg, 37% edition 
2014. 
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2.6 Further applications 


This section contains exercises related to the previous sections. They are 
mostly independent of one another and can be partially skipped without 
affecting one’s comprehension of the concepts. The examples are somewhat 
more complex than the previous ones. The reader is encouraged to consider 
at least the comments on the results in this section. 


>>> Application: How often do the hands of a clock overlap? 


Solution: 

This problem can be solved quickly (otherwise you have to give a lavish ar- 
gument): We start at exactly 12 noon, where both hands overlap. After a full 
turn of the minute hand, the hour hand has performed 1/12 (30°) of a full 
rotation. When the minute hand has rotated 12/11 of the full angle of rotati- 
on, then the hour hand has rotated 1/11 (almost 33°) of the full revolution, 
so that there is coverage again. Then, over a period of 12 hours, the hands 
will have overlapped 12/(12/11) times, that is, exactly eleven times. 


Fig. 2.40 clocks that drive each other 


® Remark: The same question is significantly more challenging to solve when two 
clocks that move at different speeds are linked via hinges on their minute hands 
(Fig. 2.40) and the whole installation is left to move on its own... ® <<< 


>>> Application: counter-directed motions 

A train with an average speed of c, drives from A in the direction of B 
(distance AB = d). At B, a train starts At hours later and drives with an 
average speed cz towards A. When do the trains meet? 

Solution: 

Let x and y be the respective distances that the two trains have travelled 
when they reach their meeting point. Thus «+ y = d. Furthermore, let t be 
the time that it takes for the first train to reach the meeting point. Then, 
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x = tc; and y = (t- At)cg. This results in tc, + (t - At) cg = d, and thus, 
d+Atcg 
———— << 
Cy + C29 


>>> Application: rock erosion (Fig. 2.41) 

The striking Table Mountain with an altitude of 1,087m is a landmark of 
Cape Town (South Africa). It is one of the oldest mountain ranges on Earth. 
Its age is estimated to be an impressive 600 million years (so, it is almost 10 
times as old as the Alps). It is believed that the peak was over 5000 m at its 
birth. What was the average erosion per year? How high was the mountain 65 
million years ago, i.e. in the period when the Alps emerged and the dinosaurs 
became extinct? 


Fig. 2.41 Table Mountain in Cape Town with its typical “tablecloth” 


Solution: 

height difference 5,000m-—-1,000m = 4,000m = 4-10°m 

time difference 600 million years = 6- 10° a 

erosion per year 4107 = 40-10" m 26.7: 10-6 = 6.7: 19-3 
The erosion in 65 million years (6.5-10’a) is then 

6.7- 10° - 6.5: 10%a * 44-10! m = 440m. 

The mountain was 440m higher 65 million years ago, so a little over 1,500 m 
high. ddd 


>>> Application: if the Antarctic ice melts ... 

The Antarctic (the mainland has a surface area of about 12 million km?) has 
only relatively recently become frozen — but now it is all the more so! The 
major part of the fresh water on Earth is bound in its ice sheet, which is 
about 2 km thick on average. By how much will the sea level rise if the ice 
melts completely? 

Solution: 


We will set the unit to kilometers. The volume of ice on the mainland is 
24-10°km?. When ice melts, it loses about 10% of its volume, so that about 
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Fig. 2.42 the Antarctic and one of its inhabitants — the emperor penguin 
21-10°km?® remains. 


The Earth has a surface area of about 500-10°km?, of which 70% x 350- 
10° km? is water. Let A be the difference in the heights of the sea level, then 


21 
350-10°km?- A = 21-10°km? > A x eee 0.06 km = 60m. 


® Remark: 60 m is quite a lot. With this rise of sea level, whole groups of islands 
would disappear, large parts of Florida would be flooded, etc. The emperor penguins 
probably would die out — not because the temperature would become too high, but 
because once again — as before - mammals and reptiles would start to live on 
the Antarctic. These mammals and reptiles would then steal the eggs of emperor 
penguins and eat their defenceless pups (see Application p. 421). 


Fig. 2.43 The last glacial maximum (* 24,500 BC): Vast ice sheets covered large 
parts of northern Europe, North America, Siberia, and also parts of the southern 
hemisphere. 


Conversely, the sea level rose to this level of 60 meters about 15 million years ago 
(because the Antarctic was free of ice). During the ice ages that have occurred several 
times in the last 100,000 years (where, for example, thick ice was superimposed over 
central and northern Europe), the sea level was 125 m (!) lower than today. Today 
divers in Southern France have found underground entrances of caves where Stone 
Age people lived! The low sea allowed a small group of people to walk across the 
Bering Strait from Siberia to Alaska. During that time, they had more than 22, 000 
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years to migrate to the two American continents (they joined together only 3 million 
years ago). ® <<< 


>>> Application: The sun dies out! 

Our sun is a giant nuclear reactor where hydrogen fuses into helium. Here 
4.5 million tons of mass per second are converted into energy. The sun has a 
mass that is 332,000 times the mass of Earth (Application p. 91). How long 
would it theoretically take to use up the total mass of the Sun? 


Solution: 

We try to get an idea of the given magnitudes by relating them to more 
familiar objects: The loss of mass per second would be — “downscaled to 
earthly conditions” — 1/332,000 of 4,500,000 tons, i.e. 13.6 tons per second. 
Let us reduce further down to a ball with a diameter of only 1 m. The Earth 
has a diameter of about 13,000km = 13-10° m. Our nickel-iron sphere with a 
diameter of 1 m is about 1/(13- 10°)? of the mass of Earth, and per second, 
it would lose 13.6 -10%kg/(13- 10°)? of its mass, which is in milligrams 


13.6- 10° mg 


= aan ge Oe 10°? mg. 


A year has approximately 30 million seconds. Thus, the unit sphere loses 
0.006 - 107° - 30- 10° mg * 0.2: 107? mg 


every year or five milligrams every 5,000 years, or 1 kg every five billion years 
— that is, about three tons of deadweight. 


® Remark: Studies have shown that the Sun has a 10 billion year lifespan of which it 
has already elapsed half of its time. Regarding the loss of mass via nuclear fusion, the 
limited lifespan of the Sun is not obvious: This proportion is practically negligible! 
Firstly, the limited lifespan is given by the uranium component of solar hydrogen, 
of which it only makes up about 70%, and secondly, only in the innermost part of 
the Sun are the temperatures high enough to propel nuclear fusion. Thus, only 10 
to 20% of the solar mass is available for fusion of hydrogen helium. ® 


® Remark: The “fading” of the Sun as the source of all life is an old phobia of many 
indigenous people. Mayans and Aztecs in the historical parts of Central America 
made human sacrifices as a result of such fears! @ daa 


>>> Application: diameter of a molecule 
An amount of 12g of carbon contains 


Na = 6.022 - 107° (2.23) 


of C molecules. This number is called the “Avogadro” constant (formerly 
Loschmidt’s number). 
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Fig. 2.44 the Sun dies out ... Fig. 2.45 water molecules 


Water molecules consist of 1 oxygen atom and 2 hydrogen atoms (angle 
105°). The same number of molecules is contained in one mol of any substance 
(mol= molecular weight in grams). 

Water (chemical formula H2O) has a molecular weight of 18 since each water 
molecule contains two hydrogen atoms (each with an atomic weight of 1) and 
an oxygen atom (an atomic weight of 16). 

So, in 18¢ of water, there are N4 water molecules. 

One can estimate the diameter of a water molecule. 


Solution: 


18 g of water has a volume of 18cm?. These fit into a cube with a side length 
of ¥/18cm x 2.6cm. 

Suppose we pack every single water molecule into an “elementary cube” of 
side length d. Then, Ny of these cubes would fit into said cube with a side 
length of 2.6cm. Thus, we have 


/18cm? / 18 / 180 
3 = 3 Ls _ 38 3 
d°? Nag =18cm’ > d 7 6-1023-@ 6-1024°™ 
3. g= 8) 2 aie 3 40 ona 40 me 03.10 
= ipcm * cm = m=0. m. 


We have shown that the diameter of a molecule is less than one nanometer. 


@ Remark: According to the above result, we can say that, in 2g of hydrogen, there 
are N4 hydrogen molecules Hp. From this, one can determine the mass of a hydrogen 
atom, H, which used to be the so-called atomic mass unit until 1961 (since then, 
4 of the atomic mass of !?C’ has been used as the legal unit): ® 


® Remark: 
1 2 


“* 2 6.022-102 


5 8 = 0.166 -10°* g = 1.66- 10-7" kg. 
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Accordingly, a helium atom has a mass of » 4u, one carbon atom has 12u (since 
exactly 1961!). An oxygen atom contains » 16u, a gold atom » 197 u etc. — each of 
the corresponding values are located in the “Periodic Table of Elements”. The mass 


: uw 9-103! kg 
24 


of the electron is (almost) negligible: An electron has a mass of 


(note that this is residual mass). ® <<< 


>>> Application: continental drift 

Pangea broke up into two roughly equal continents called Laurasia and Gond- 
wana roughly 280 million years ago. About 150 million years ago, Gondwana 
further broke and has been drifting apart since then — as seen, for example, 
in the movement of South America and Africa away from each other. What 
used to be a daring theory (Alfred Wegener, 1912) can be confirmed through 
measurement today. Assume that the drift velocity v was reasonably con- 
stant and calculate how far Africa and South America drift apart every year 
or how far they drift apart every second when their current distance is about 
5,000 km. <<< 


x 
Fig. 2.46 drifting continents, tectonic plates 


Solution: 
_5,000km  5-10°m = 3-107m_ = 3-10?7m_— 3-10? m_—i1-10-°m 


150:106a 150-10%a a 365-24-3,600s 3-10%s S 
We have a drift velocity per year of 3 cm, and as a result of Application p. 49, 
we know this involves about three water molecules per second, given that we 
know their diameters. 


® Remark: We can measure the original distance using the drift velocity of 38cm per 
year by looking at the current positions (currently at 1m accuracy) of many places 
in Africa or South America over large time intervals by means of GPS (Application 
p. 13, Application p. 363). Then, the drift velocity is obtained by calculating the 
average, and this becomes more accurate given the more measurements one has 
taken and the longer the time interval is (for example, 5 years). ® 


>>> Application: a global conveyor belt as an air motor 
The enormous amount of 20 million cubic meters of salt water (which is 
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almost half the amount of freshwater on Earth) flows every second, most 
of it at great depths. It flows in streams at a rate of 1 to 3 km per day 
repeatedly around the world (Fig. 2.47). In certain places, for example, in 
the Gulf of Mexico, this stream is “caught” and rises so that it heats up, 
quickly reaching the polar regions (the Gulf Stream!), where it cools down 
rapidly and descends again. Looking at the six projections in Fig. 2.47, the 
question arises: How long will a full cycle take? 


Fig. 2.47 thermohaline circulation, colloquially shown as a global conveyor belt in 
different views of the globe. Images using a single view of the globe can easily lead 
to misinterpretations of the currents (particularly around the Antarctic). 


Solution: 

An important preliminary remark: The circulation belt is usually depicted on 
a “rectangular projection” where each estimation of length — especially near 
the poles — leads to incorrect results.The sphere is doubly curved and cannot 
be unfolded distortion-free into the plane (Application p. 321). The multiple 
belts encircling the South Pole (bottom right) is clearly visible in the image. 
It is in total no more than a huge loop in the Pacific (bottom left). If we 
estimate the length of all flows thoroughly, then the length is roughly three 
times the Earth’s circumference (120,000 km). Assuming the flow covers a 
distance of 1 to 3 km daily, then assuming a yearly distance of 600 km — just 
to have a “nice number” — the cycle lasts for 200 years. <<< 


>>> Application: /ens power 
One can use Formula (4.29) to obtain the formula for the focal length of a 
thin biospheric lens (Fig. 2.48). 


2.6 Further applications 53 


Solution: 

Formula (4.29) applies to both spherical surfaces (radii r; and rg, with re- 
fractive indices ny =n and ng = 1/n, = 1/n) where Formula (4.29) is: 

1 n n-l 


=4—5-—_—, thug, 
g Ob Tr 
1 ual i of 1/n-1 
@ 2222" amd 2) == i Ae 
oy oT G92 be rg 


Now, we can connect “in series”: The image width b, of the first refraction is 
the object’s distance, gz, and the result for the second refraction is the image 
distance b = bz (negative! ): 


Fig. 2.48 aspherical lens: two slightly different curved spherical surfaces 


1 in _lfn-1 > (3) ed 
—by b T2 by b r2 
If we add (1) and (3) and set gi = g, then we obtain 


1 1 1 1 
ms =(n-1)( ). 


b iil T92 


Objects at infinity (g = co) are mapped to the focal point (focal length f). 
With 1/g = 0, we get 


1 1 1 
maar 


Numerical example: Setting r; = 16cm, rg = 8cm, and n= 1.4 results in 


( 1 1 ) -0.4 0.025 2.5 


1 
=(1.4-1)- 
‘i 
The value denoted by 1/f (where f is measured in meters!) is called the lens power 
and it is expressed in diopters. Our lens has —2.5 dpt (a suitable power for a spectacle 
lens made for near-sighted people, Fig. 2.48 on the right). 


f =-0.4m. 


16cm 8cm 16cm cm m 


® Remark: The refractive power of the cornea is normally about 43 diopters (dpt) 
(focal length f » 2.33cm), the refractive index of the lens is about 19 diopters. 
Visual defects are expressed in positive or negative deviations. The emmetropic eye 
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has a total of 65 diopters (focal length f x 1.54cm). Its value is not determined 
by combining the power of the lens and the cornea as discussed. With hyperopia, 
the rays meet behind the fovea centralis (with a positive deviation); with myopia, 
they meet just before (with a negative deviation). The cornea has a refractive index 
that can be compared with that of water. Therefore, underwater light from the 
cornea is hardly broken. But because the lens in these changing conditions cannot 
compensate enough, our vision becomes blurry when underwater. ® <<< 


>>» Application: depth of field (DOF) for a photographic lens (Fig. 2.51) 
Derived from the lens equation in Application p. 16, 7 7 wt (Formula (2.3)) 
follows with b=nf 

g= — f. (2.24) 


Let us imagine the lens from Fig. 2.9a built into a camera lens. Behind 
the lens, the photosensitive layer will be perpendicular to the optical axis. 
Because of the lens equation, the only points that appear in sharp focus are 
those that lie in the plane y parallel to the photo plane. Assume that. for 
b, a tolerance of t f is allowed. (In general, t is considered to be very small, 
e.g. t= 0.01). What is the “depth of field” in this case? How far is the object 
allowed to deviate from y? 


Solution: 
We have nf-tf<b<nf+tf. The extreme image distances 


b, = f(n +t) and bo = f(n-t) 


correspond, according to Formula (2.24), to extreme object distances 


d 


n+t n—-t 


f, Ge at 


Then, the difference gz — g; equals the depth of field 


—t +t 
2 err reeaee E 


We try to simplify the expression in parentheses by finding the common denominator 


N=(e=t=D(n4t-1)=[(m=1)=t][@=1) 44] sm=1)7 =P. 


The numerator Z is then Z = (n-t)(n+t-1)-(n+t)(n-t-1) = 


= (n? -#?) -(n-t) -[(n?-#?) - (n+ t)]n? -? —n+t-[n?-# —n-t] = 2t 


and we have s = & fs maya f. If t is small, then t? is much smaller (for example, 


t = 0.01 => ¢? = 0.0001). Thus, we can write 


2 
° (= 1)?” 
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The result should not be interpreted rashly by assuming that a better depth of field 
is a larger one, i.e. the larger f, the larger s. Both ¢t and n depend on f. Let us 
replace tf by to (where to is a constant value representing time). From the lens 
equation, we calculate 


1 g-f b g 


>n >n-l= 


11 
i 2 fe f g-f g-f g-f 


Then, we get 


and recognize: 


The smaller the focal length f, or the further the object, the larger is the DOF. 


Fig. 2.49 wide-close shooting vs. telephoto shooting 


Numerical example: We want to take a picture of an object that has a distance of 
1.50 m, once with a wide-angle lens (f; = 30mm) and a second time with a telephoto 
lens of (fz = 150mm). 
Let g = 1,500 mm, i.e. 


g/fi =1,500mm/30mm = 50, g/ fo = 1,500 mm/150 mm = 10. 
For the depth of fields, we have 


S$, = 2 tp49?, 89 = 2to9? > 81: 89 8 30:1. 


Thus, we see that the telephoto lens can have a sharp image only with a much 
smaller depth range. Yet, you have to remember that such a lens allows us to 
capture details that might be displayed in sharp focus but they only appear tiny. 
Enlarging a section of the corresponding image will mostly lead to a loss of image 
resolution. 
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Fig. 2.50 depth I Fig. 2.51 depth I 


® Remark: The lens equation is only accurate when the incident light rays do not 
deviate too much from the optical axis. This environment is called Gaussian space. 
The depth of this field can be increased very effectively by choosing a large aperture. 
Pictures in the macro area are very critical because they are mapped very close to 
the simple focal length. Fig. 2.51 shows two fighting male stag beetles (Lucanus 
cervus). Here, the depth could be achieved only through the widest aperture — this 
one must either use an ultra-light-sensitive film or a flash. Nevertheless, the foot 
joints in the front and rear area are blurred. In contrast, the blur of the background 
in Fig. 2.50 (garter snakes) — due to the small aperture of 2.8 and the short distance 
as a result of the short exposure time 1/250s (Application p. 70) — is quite desirable. 
The background should be completely neutralized. 
Intentional blur is an important design element in photography. In this specific case, 
it was important to capture the snakes’ eyes and tongues in sharp focus, so that the 
viewer’s attention is drawn to these elements of the photograph. 
Even the human brain works in this manner, with the eyes acting like an “external 
branch”. One could say that we have the advantage of a “selective perception”. ® 
ddd 


>>> Application: How long does an electron flash expose? 

The built-in flash of a DSLR cannot be arbitrarily “synchronized”. One can 
illuminate only with relatively long exposure times (about 1/250 sec). Nevert- 
heless, images that should actually be blurry may be engraved and appear 
sharper than preferred. Guess the actual exposure time by the flash of the 
pictures shown in Fig. 2.52. 

Solution: 

A ball (depicted in the middle of Fig. 2.52) is launched by a spring to a height 
of approximately 1.2m. At a height of 40 cm, the ball has just enough speed 
to make the remaining 80 cm. 

With Formula 2.19 (Application p. 38), we deduce the instantaneous velocity 
of the ball v = /2gh » 4m/sec = 4,000 mm/sec. During the time the picture 
was taken, the ball barely moves — perhaps 1 mm. This means that the flash 
exposes approximately 1/4,000 of a second. So, under certain circumstances, 
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Fig. 2.52 left: the flashed (crisp) photo; middle: the test; right: without flash 


one can “freeze” the blazing fast moving wings of a bumblebee (Fig. 2.52, 
left). 


® Remark: For comparison: 1/250 seconds (in sunlight) is definitely not sufficient 
to depict the wings of a bee in sharp focus (picture right). In direct sunlight, flash 
cannot be used to “freeze”: The residual light during the synchronization time leads 
to the exposure of the film. Here, you have to work with an external and relatively 
expensive flash. 

Digital cameras have no problem with flash synchronization, because they require 
no mechanical motion of a mirror. ® <<< 


>>> Application: the product of two pencils of rays (Fig. 2.53) 

A straight line a rotates about a fixed point A(0/0) with a constant angular 
velocity of 1 while a straight line b rotates about a fixed point B(4/0) with 
a proportional angular velocity of a) -1, b) +1, c) 2. Consider the locus of 
all points of intersection. 
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Fig. 2.53 “product” of two pencils of lines 


Solution: 

The overall calculation of all points is, of course, done by computer. Here, I 
will just say how to “draw up” the line equations: A straight line through the 
origin O with an inclination angle a to the x-axis has the equation y = ki x 
with k, = tana (or in the usual notation, of -k; x+y = 0). A straight line 
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through the point B(4/0) with the inclination angle 6 and kz = tan @ has the 
equation —kz2+y = —4k 9. The principal determinant of our (2, 2)-system is, 
ky 1 
ko 1 
be excluded for the moment (since the lines are always parallel in this case). 
The other two determinants are 


therefore, D = = ky — ky. It vanishes if k, = ko, a case which shall 


0 1 
-Aky 1 


—ky 0 


De=| ky Ake 


|= ato and Dy =| [= 4th 


Ako /fhike 
ko-ky! ko-ky )* 


Fig. 2.53 shows some specific solutions. In the general case, an equilateral 
hyperbola is defined as being generated by two indirectly congruent pencils 
of lines (Fig. 2.53a), a circle by two directly congruent pencils (Fig. 2.53b). 
If the ratio of the angular velocities equals 1 : 2, the locus of intersection 
points is a cubic curve (Fig. 2.53c). If the two lines start at the same position 
with the ratio of the angular velocities equalling -1:1, then the hyperbola 
collapses to the bisector of AB. If the ratio of angular velocities is 1:1 and 
the lines are parallel in the beginning, then they remain parallel. If the ratio 
equals 2: 1, then the point of intersection traces a circle (the central angle is 
twice the angle of circumference — see Fig. 4.52). <<a 


Thus, the intersection of the lines is S' ( 


>>> Application: supersonic speed in free-fall 

Felix Baumgartner’s stratosphere dive in October 2012 helps us find out how 
long one must fall within a (near-)vacuum in order to break the sound barrier, 
and it helps us to figure out the length of the distance travelled. 


Solution: 

Properly formulated, the problem is easily solved: The acceleration of gravity 
(1 g) is approximately 10 meters per second squared (at an altitude of 40 
km it is still almost the same). This simply means that the instantaneous 
velocity increases every second by 10 meters per second. After 32 seconds, the 
diver is, therefore, able to reach 320 m/s, which corresponds to the speed of 
sound at about —20° Celsius (the speed of sound is dependent on temperature 
and increases by about 6 m/s given each 10° temperature increase.) Since 
the uniform acceleration is the average speed — equal to half the maximum 
speed, i.e. 160 m/s, the distance travelled is thus 32-160 = 5,120 meters. 


® Remark: If one theoretically accelerated (by means of rocket-power) for one year 
(that is 30 million seconds) with 1 g, one would reach 300,000 km/s — this is the 
speed of light. However, this is purely theoretical, because if you spin this idea 
further, you will reach superluminal speed ... 
Realistically, however, just about 10 g (100 m/s speed increase per second) is beara- 
ble! At this rate, in just under two minutes, one could reach the necessary 11.2 km/s, 
in order to be able to escape the Earth’s gravitational field (Application p. 403). ® 
ddd 
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>>> Application: A body brakes. 
A body (for example, an automobile) moves with velocity vp and should be 
brought to a standstill within d meters. What is the average deceleration a? 
Solution: 
For the instantaneous velocity v, the following applies in the case of a uniform 
deceleration a: 

v= —at. (2.25) 


For v = 0, this results in ¢ = “. Inserting this variable into the formula for 
the distance . 
s=vot— ah (2.26) 


one obtains 


Vg arvo\2 ve 
d= VO u ( =) = . 
a 2a 2a 
From this, one can gather that the length of the braking distance depends on 
the square of the output speed. Thus, we obtain for the average deceleration 


2 
20) 


Soa (2.27) 


a 


>>> Application: bungy jumping (Fig. 2.54) 

Bungy jumping (also “bungee jumping” or “bungi jumping”) is an “invention” 
of the inhabitants of New Zealand. In this daring activity, a person jumps 
from a fixed spot, at a great hight, to which they have been tied by an 
elastic rope. Until the rope length Lg is reached, the person will fall freely. 
After that, the elastic rope will produce an increasing braking effect until the 
maximum cable length of Dmaz is reached. How long does the bungy jumper 
dive in absolute free fall? 

To give a specific example: A jump from the Bloukrans River Bridge, South 
Africa (216 m height) is currently the longest possible dive: Lp = 90m, Lmax = 
170m. 

Solution: 

The modified Formulas (2.25) and (2.26) for the vertical fall (initial velocity 
vo, current velocity v, acceleration due to gravity g 9.81m/s”) are: 


v=uot+gt, s=upt+ SP. (2.28) 


When horizontal bounce applies, then vp = 0 (since there is no downward 
velocity component). The formula is valid until the time Ty, when the cable 
is tensioned. Then, we have 


{21 
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Fig. 2.54 bungy jumping 


The corresponding velocity is 
Vo =0+g9To = / 2gLo. (2.29) 


In our special case (Zo = 90m, Lmaz = 170m): 
Ty % 4.288, Vo x 42m/s » 150km/h 
® Remark: From the time To, g is reduced by the (ever-increasing) cable delay. The 
speed continues to increase until the rope force exceeds the bungee jumper’s weight. 
This occurs when the rope reaches its maximum length and the jumper hangs at 
rest (i.e. when the rope force and gravity are in equilibrium). 
We will examine the rather complex conditions in the final phase of the flight in 
more detail by means of differential calculus in (Application p. 274). Here, we only 
determine the (initially not very telling) average deceleration a and use it in the 
result of Formula (2.27) of Application p. 59. Using Vo instead of up, the braking 
distance is the rope expansion d= Dnax — Lo 

Vo glo 


a 2 Lmat and Lo) 7 Limax _ Lo 


The average deceleration of the rope b is naturally greater than g, because the total 
acceleration consists of still-operating gravity and the counteracting deceleration of 
the rope. 

For example, let Lo = 90m, Lmagz = 170m: 


ae ll > bw w2QIMy. 
Ss § 


The average deceleration 6b of the rope is naturally not constant but it is, according 
L-L 
° If bis 2.1g on 


0 
average and increases linearly from 0 to a maximum value bnaz, then bmax will be 


to Hooke’s law, proportional to the current cable strain ¢ = 


approximately equal to twice the average acceleration, i.e., about 4g. ® da 


>>> Application: gravity through freefall 
During free fall, you are — at least in a vacuum — weightless. A person who 
jumps out of an airplane barely reaches speeds above 50 m/s, because air drag 
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and weight are balanced. A plane can overcome the barrier of air resistance by 
motor force. How long can one thus create weightlessness with an airplane? 


Solution: 

A plane cannot rise arbitrarily high because it takes a certain air density to 
fly. At an altitude of about 10 km, it could plunge towards the ground “like 
an eagle”. Through proper use of the engines, constant acceleration can be 
achieved just as for a body in vacuum. At a certain instant, the pilot has 
to end the plunge and must not accelerate any further. Team and materials 
should not be overtaxed. From Application p. 59, we know that acceleration 
and deceleration have to take the same amount of time in order to reach a 
deceleration of 1 g. To stay on the safe side, you could thus accelerate from a 
height of 10 km to a height of 6 km and then be located in level flight again 
at a height of 2 kilometers. This would correspond to a free fall of 4,000 m. 
With 


2 
s=$P st= #5 ~~ /8008 8 288 
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we compute 28 seconds of weightlessness. The maximum speed would be 
v =gt* 280m/s, thus, below the speed of sound. 


® Remark: In fact, aircraft are used to perform physical experiments in weight- 
lessness. An Airbus 300 is used in practice. Here weightlessness means achieving 
the ascent phase of a “parabolic flight”: the machine faces straight up with an in- 
creasing speed and is then forced into an inverted flight parabola — this is where 
weightlessness occurs (the acceleration of gravity acts by reversing the parabola of 
the achieved speed, and so we get the same path acceleration along the opposi- 
te curve of the parabola). After the culmination point of the parabolic flight, the 
conditions in free fall are shown as described. @ <<< 


>>> Application: squaring the circle 

A square (in blue) is to be increasingly “rounded” so that the circumference 
or the area remain the same, as with the square shown on the left of Fig. 2.55 
and even more with the one shown in the center of Fig. 2.55. In particular, 
the radius of the limit circle is to be determined so that the circumference 
or the area are equal to those of the initial square. 


Solution: 

Both in the circumference, as well as in the area of the circle, the circle con- 
stant 7 is utilized. One can, therefore, deliver no exact constructive solution 
to the problem, but only one through calculation: 

Given the side a of the square, its circumference is U = 4a and its area is 
A= a’. With U = 2rr or A =r’, we immediately have the radius of the 
limit circle: 
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For the intermediate circles (tangent distances y and the radius x of the 
rounding circles), we have: 


) 2 
da = dy +4." or yP + dary +4 = a”. 


T 
With equal circumference, we have the linear condition y = a- Pa with 


equal area, we have the quadratic condition y = —2x + \/a? + (4-—7)x?. The 
limit positions are reached at y = 0. 


Fig. 2.55 converting a square into a circle (left: equal circumference; middle: equal 
area; right: an interpretation of Leonardo’s Vitruvian Man) 


® Remark: Leonardo da Vinci is supposedly believed to have found a graphical 
solution for land conversion (squaring the circle). He suggested that the radius of 
the circle should enlarge a thousand times (thus increasing the area to a million 
times larger) and this giant circle would be composed of a million equal sectors. 
Such a sector would be virtually indistinguishable from an isosceles triangle whose 
surface can be transformed into a rectangle, and then easily into a square. For a 
practitioner who is satisfied with a finite number of decimal places, this is a perfectly 
workable solution! In any case, his drawing, which is one of the most famous sketches 
in the world, depicts man inside a square or circle (Fig. 2.55, right). @ <<< 


>>> Application: quickly estimating a large area 

In Kolontar (near Ajkai in Hungary), the dam of a lake broke in October 2010. 
This lake was filled with toxic aluminium sludge. As the news spread across 
the world, it was rumoured (and this information is still available online on 
some websites) that an area of 40,000 km? was flooded. In fact, the flooded 
area covered a surface of only 40 km? (which is still a considerable amount). 
As an ordinary citizen, how do you quickly assess the maximum amount of 
sludge brought by the flood in cubic meters or the realistic surface area of 
the flooded region at a given time? 

Solution: 

A Google Earth image (Fig. 2.56) with the plotted scale shows the dam 
(depicted in orange because at that time it was still filled with sewage sludge). 
The dam can be rapidly approximated by a polygon. Now you can convert 
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Fig. 2.56 Google Earth image with plotted scale (bottom left). The dam is in orange 
and approximated by a polygon. Middle: Estimated conversion of the polygon to a 
square (indicated by a red dashed line) of the same area. 


the polygon “instinctively” and generously to a square, with no claim to great 
accuracy. Taking the scale of the image into account, the following can be 
said: If the square has 500 to 600 meters as a side length, then the area of the 
polygon will be approximately 300,000 m?. If the lake of sludge has a depth 
of 3 meters (a number that was bandied about and appeared realistic), the 
resulting area will be significantly larger as it will be covered by 1 million 
cubic meters of mud. With a sludge depth of 3 cm (instead of 3 m) the surface 
is reduced a hundredfold to 100- 300,000 m? * 30 km?, which is close to the 
actual result. <<< 


>>> Application: all the gold in the world (Fig. 2.57) 

An annual amount of My, = 2,500 tons of gold are produced worldwide. 
The American United States Geological Survey estimates that more than 
80 percent of the current gold production of humanity was produced after 
1900 and amounts to a total of M> = 160,000 tons of gold produced by 
human hands. This is “nothing” in relation to the estimated M3 = 30- 10° 
tons produced throughout the Earth’s crust (http://www.zeit .de/2008/ 
16/Stimmts-Gold). How big would the respective cubes made up of the entire 
gold mass of M1, Mz and M3be? Gold (aurum) has a density of 04, = 19.2 
per dm?. 

Solution: 

We compute with meters and tons. Then 19.2 tons have a volume of 1 m3. The 
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corresponding volumes are thus Vj = 2,500/19.2 130m?, Vi = 160, 000/19.2 
8,300m?, V3 = 30-109/19.2 » 1.56. 10°m?. By extracting the cubic root, we 
get the edge lengths of the cubes: 5 m, 20 m, and * 1,160 m. 


® Remark: Remarkably, there is a seemingly insignificant difference between the 
first two cubes. But the volume increases dramatically with the edge lengths of the 
cubes. This will be dealt with in more detail in the second chapter. © <<< 


>>> Application: a crown of pure gold 

Legend says that King Hiero confronted the great all-round scientist Archi- 
medes with a tricky question, and the answer was a matter of life or death 
for his goldsmith: He wanted to know if his crown was made of pure gold, 
but this should be determined without destroying the crown. Archimedes’s 
answer: He brought a balance beam (Fig. 2.58) to balance between a piece 
of pure gold on one side and the crown on the other side. It changed upon 
immersion within water and was not at equilibrium, which spelt bad news 
for the goldsmith ... 
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Fig. 2.58 the gold test of Archimedes: G, : Gg = dg: dy 


Solution: 
Here Archimedes combined two of his most important inventions: the law of 
the lever and Archimedes’s principle concerning buoyancy. Suppose a calibra- 
ted weight of pure gold (weight G1, mass Mj; = G1/01; the density 01 = 04u 
being as given in the previous example). With a volume of V2 and a density 
of 02, the crown has the weight G2 = V2: 02-g. If the beam balance (Fig. 2.58, 
left) is in equilibrium, then G, : Gp = dg: d, from which G2 can be deduced. 
If we now submerse the two weights completely in water, their buoyancy 
forces reduce by about the weight of either displaced amount of water. Thus, 
the respective density decreases indirectly by about the density of water, i.e., 
about 1 kg per dm?: 

1=M,- (01-1) = G1- (01 -1)/01 and G3 = Mg - (a2 - 1) = G2- (02 - 1)/02. 


With the same density 01 = 02, the balance remains in equilibrium. However, 
if @2 < @1, then the buoyancy of the crown is slightly higher, and as a result, 
the crown rises. Equilibrium is reached again only when the bearing point is 
moved to the left by a (measurable) distance of A (Fig. 2.58, right). With a 
little skill, you can calculate 02 from all known values, and it subsequently 
determines a possible silver content in the crown. This works well when using 
a weight G, that is not made of pure gold. aad 
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>>» Application: swarm rules (Fig. 2.59) 

At times, shoaling fish or flocking birds seem to “dance” in the water or in 
the sky. One might be excused for thinking that there is a complicated and 
deliberate choreography behind it. Animals often gather or travel together 
in large numbers. In many cases, this swarming behavior serves as a defense 
against predation. Yet, how can we explain the complex and intriguing mo- 
tions of swarms, as these collections of animals change directions within a 
fraction of a second, split up into groups and then reunite? 


Fig. 2.59 three major rules ... 


Solution: 

One might assume the existence of an “alpha specimen” that determines the 
motion of the swarm. However, how is it possible that this individual always 
stays at the front of the pack? In fact, there is no such leader of the pack. All 
members of the swarm are equal while they are in motion and merely follow 
three very simple rules: 

1. Move in a common direction. 

2. Always keep a certain distance to your neighbours. 

3. If a predator is approaching, escape. 

In moments of danger, the distances between neighbours increase due to 
reaction time. The swarm becomes wider and may even “tear apart”, but as 
soon as the predator has left the scene, the remaining group usually reunites. 
As a virtual shark attacks a swarm in a computer simulation, all individuals 
obey the abovementioned rules. This simulation yields extremely realistic 
behaviour and may, thus, be taken as heuristic “proof” that the swarm rules 
actually exist. What is more, predators are usually distracted by swarming 
behaviour, and the chances of survival are larger for the individual. <<< 


>>» Application: sum of cross-sections (Fig. 2.60) 

Thinkers as far back as Leonardo da Vinci have suspected that, as trees 
branch out into ever more intricate formations, their total cross-section, ne- 
vertheless, stays roughly the same. This apparent rule has been explored and 
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Fig. 2.60 Computer simulations based on different parameters of trunk thickness, 
iteration count, and branching angle. The sum of all cross-sections is constant. 


refined by computer graphics engineers. Let us attempt an analysis based 


Fig. 2.61 African Aloe tree 


on the African Aloe tree (Fig. 2.61). It may not be exact, but it is sufficient 
to imagine the branches as having a locally circular cross-section. If this 
is true, then the following reasoning works relatively well: Wherever a first 
branching occurs, we select the center M of a sphere, which then includes 
the cross-section circles of the branches as small-circles. We can measure the 
radii of these circles. If the largest (lower) circle has a diameter of 1 unit (= 
100%), then the smaller circles have radii of 0.57 (57%), 0.38 (38%), and 0.7 
(70%). We know that the circle area increases quadratically as the radius 
grows linearly. In fact, our result of 0.57? + 0.387 + 0.7? = 0.96 is relatively 
close to our expectation of 17 = 1. It would seem that Leonardo’s rule works 
well in the case of the smallest sphere. Now, we increase the sphere radius. 
The number of branches has grown to 25, which would suggest an average 
cross-section surface of 1/25 (1/5 of the maximum diameter). Leonardo’s rule 
does not seem to apply this time, as the total cross-section seems to have 
grown smaller. This irregularity is even more pronounced when considering 
the third sphere, where the total cross-section is even smaller. << 


3 Proportions and similar objects 


In this chapter, we deal with similar objects and other proportions. The 
insights thus won have many practical applications and benefit the under- 
standing of many natural phenomena. 

In particular, the surface area and cross-section area of a body do not change 
at the same rate as its volume or mass. Once these “relations of scale” have 
been understood, a multitude of interesting conclusions may be drawn. One 
such conclusion is, for instance, that one should be careful when drawing 
comparisons between large and small objects in nature, even if they share 
many visual characteristics. 

It becomes clear that it is advantageous for large animals to be warm-blooded, 
while it is largely impossible for small animals to be so. In relative terms, 
however, smaller animals are significantly stronger. An enlargement of agi- 
le, mostly flying insects immediately changes their properties. The gigantic 
monsters that roam through fictional worlds would quickly fold under their 
own weight. Similarly, superstructures like skyscrapers, bridges, and ocean 
liners have to follow different rules than their miniature models. 

Gravity and attraction play an increasingly significant role as the sizes and 
masses of objects increase, until these forces begin to dominate shapes and 
behaviours on the cosmic scale of stars and planets. A clever use of proporti- 
ons allows us to quickly and efficiently derive astronomical phenomena, such 
as the calculation of orbital times and velocities of planets and satellites. 
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3.1 Similarity of planar figures 


Let us start by defining what we mean by similarity: 


Objects are considered to be similar if they differ only in scale but not in form. 
Angles are identical, and pairs of lengths are at a constant proportion k with 
respect to each other. 


In the plane, all circles are similar to each other — as are all squares, equila- 
teral triangles, etc. In three-dimensional space, spheres, cubes, equilateral 
tetrahedra, etc., are similar to other objects within their respective group. 
In the plane, the following important theorem applies: 


If the lengths of a two-dimensional object are enlarged by a factor k, then its 
area is enlarged by the factor k?. 


Proof: 1. This theorem evidently applies to each triangle where the following holds 
(Fig. 3.1): 
area = $-base length x height. 


If the base line and its height are enlarged by a factor k, then the area is enlarged 
by the factor k?. 


Fig. 3.1 on the area of a triangle Fig. 3.2 triangulated 2D object 


2. Each closed, planar polygon — even with “holes” as in Fig. 3.2 — can be “trian- 
gulated”, i.e. split into triangles. Curvilinear objects can be approximated to an 
arbitrary precision by polygons. © 


>>> Application: Ejinstein’s proof of the Pythagorean theorem 

Almost everyone knows the formula a? + b? = c? for the lengths of the sides 
of a right triangle. There are hundreds of different proofs for it. Particularly 
noteworthy is the proof that the eleven year-old Albert Einstein (1879-1955) 
discovered. 

Solution: 

Einstein imagined a triangle ABC as being composed of the similar (since 
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B 


D 


A 


Fig. 3.3 a remarkable proof 


the angles are the same) component triangles CBD and ACD (Fig. 3.3). 
All three similar triangles (they have hypotenuses a, b, c) can be obtained 
from a prototype with a hypotenuse of length 1 by multiplying its sides by 
the factors a, b, c. Let F’ be the area of this prototype. Since the area of a 
triangle grows as the square of the scaling factor, we have F-c? = F-a?+F-b?, 
and one has only to cancel F' to obtain the Pythagorean theorem. <<< 


The circumference of an object increases linearly (multiplied by the factor k). 
Thus, in all similar objects, we may find relations that are dependent on the 
scale of the object: area : circumference. 

Three examples shall be given: 

1. square: area a?, circumference 4a => area: circumference = re 


2. equilateral triangle: area Bq? circumference 3a => 


2 a 
area: Circumserence = ——~= 
fe 4/3? 


2 a 


3. circle: area 7a*, circumference 27a = area: circumference = $. 


Thus, we conclude: 


When a planar figure is enlarged, the ratio area: circumference increases pro- 
portionally to the scaling factor. 


>>> Application: a strange thread around the equator 

Imagine the Earth as a completely smooth sphere (radius 6,370 km). We now 
wind a thread (tightly) around the equator. Then we extend this very long 
thread by only 10m and lift the thread uniformly from the surface so that it 
is stretched again. How high is the thread? 


Solution: 
The solution is initially surprising: The radius R of the circle (circumference 
Uo = 27R) lifts the thread up to around ae = 1.59m. A circle with radius 


R+ Hye has the circumference: 


i 
U=2n(R+ S=) = Uo + 10m. 
21 


If we work with a factor of k, this is at least plausible: We increase the 
circumference of Up = 40,000,000m at U,. = 40,000,010m. Thus, it is k = 
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re = 1.000000 25. The radius R of the circular tensioned thread increases 


from 6,370,000 m to 6,370,000m-k = 6,370,001.59m. This in proportion is 
very little, but yet it is 1.59m. 


Fig. 3.4 How much land can you cover with a bull hide? 


@ Remark: A thread that became famous in history is the one that, as legend has 
it, led to the founding of Carthage (Fig. 3.4): 

The Numidian king Jarbas unkindly granted the Phoenician Princess Elyssa (also 
known as Dido, see Fig. 3.4) only as much land as could be encompassed by a bull 
hide. Jarbas could not know that Elyssa would cut the skin into thin strips so that 
she could fence off a vast territory ... ® <<< 


>>> Application: enlargement or diminishment with a photocopier 

When a photocopier shows an enlargement by 141%, the surface area doubles 
(for instance, from A4 to A3): 141% x /2-%. At 71% » 1/\/2-%, the surface 
area is halved. <<< 


>>> Application: side length of the parachute by Leonardo da Vinci 
Five-hundred years ago, Leonardo da Vinci sketched the first parachute in 
the form of a four-sided pyramid (Fig. 3.5, left). It might have worked, as 
there is a hole at the top that had a stabilizing effect. The man on the left of 
Fig. 3.5 must still have had a rough landing. If a square base with 7 meters 
on each side brings a body of 70 kg mass to the floor in a reasonably safe 
manner, how big is the base if twice the mass has to be landed safely at the 
same rate of fall? 

Solution: 

The air resistance increases at low speeds linearly with the cross-sectional 
area, i.e. with the square of the side length. We need twice the resistance, so 
the /2-fold side length (10 m). «<< 


>>> Application: shutter speed, apertures, and sensor sensitivity 
In manual exposure photography, the shutter speed (= exposure time) and 
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Fig. 3.5 parachute options 4 la Leonardo 


the aperture need to be configured. The shorter the exposure time, the less 
light reaches the light-sensitive sensor. Thus, if the light conditions are to be 
kept constant, and the aperture number doubled (thus halving the aperture) 
in order to achieve a greater depth of field (see Fig. 2.51), a four-fold exposure 
time must be configured! 

How long, for instance, must one expose with an aperture of 16 if the exposure 
time at an aperture of 8 takes 1/500 seconds? 


Solution: 

An aperture setting of 16 only lets in one quarter of the light onto the light- 
sensitive surface in comparison to the aperture setting 8. It is, therefore, 
necessary to configure an exposure that takes four times as long (1/125 se- 
conds). In digital photography, it is possible to compensate by adjusting the 
sensitivity of the sensor (see the next example). aa 


>>> Application: inferences about brightness 

If the exposure time, aperture setting, and sensor sensitivity are known, then 
the brightness of a scene can be estimated with good accuracy. The object 
in Fig. 3.6 was photographed three times based on the following camera 
configurations: the left photo with 1/200 s exposure, an aperture setting of 
11, and ISO 100; the center photo with 1/160 s exposure, an aperture setting 
of 10, and ISO 400; and the right photo with 1/80 s exposure, an aperture 
setting of 7.1, and ISO 1600. What can be inferred about the brightness of 
the three scenes? 


Solution: 

If we take the interior scene as the reference, then the picture in the shade, 
due to half the exposure time, an aperture setting 1.4 times greater (decrease 
of incident light by a factor of 1.4? ~ 2), and a fourfold sensor sensitivity, must 
have been more strongly lit by a factor of approximately 2-2-4 = 16. 

The scene in the sun — when compared to the scene in the shade — underwent 
an exposure that was 200/160 = 5/4 shorter still, with an aperture setting 1.1 
times greater (1.1? » 6/5 times more incident light), and with light-sensitivity 
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Fig. 3.6 in the sunlight, in the shade, and inside a building 


reduced by a factor of four. Thus, it can be estimated that the scene reflected 
about 5/4-6/5-4 = 6 times as much light as the scene in the shade. It can thus 
be estimated that the leftmost scene was illuminated 6-16 * 100 as brightly 
as the building interior. 

® Remark: If the objective was to picture the toy animal as well as possible, then the 
center scene is clearly preferable: Strong shadows, as in the left picture, may distract 
some viewers from the subject. The sensor sensitivity ISO 400 does not yet produce 
strong noise artefacts, but allows for a higher aperture number, and thus, for greater 
depth of field. ISO 1600 (as in the interior) distorts the colors somewhat more — 
on the other hand, one is often left with no choice when photographing interior 
scenes, and when no photo flash is available. The quality of a photo sensor depends 
significantly on how strongly it produces noise artefacts in low-light scenarios. 
What is true for the light-sensitive sensor also applies, to some degree, to us people: 
We need a “daily quantity of light” for our well-being. Whether in the baking sun, 
or beneath a cloudy sky: We take in at least ten times more sunlight under the 
open sky than in interior spaces. Not even daylight lamps can replace the outdoors! 
What is more, glass surfaces change the structure of light to a significant degree 
— for instance, by filtering out large amounts of UV light. For this reason, it takes 
much longer to get sunburnt in a vehicle where all windows are fully shut. ® <<< 


>>> Application: zooming in as much as possible 

The optical zoom factor is an important technical specification while shooting 
(with digital zoom one obtains no information). Compare the images in Fig. 
3.7 in terms of resolution and the zoom factor (left: 10-megapixel photo with 
a resolution of 2,592 x 3,888 pixels, center: 3-megapixel shot with a high- 
quality HD camcorder together with a bolted teleconverter with a resolution 
of 1,440 x 1,920 pixels). 

Solution: 

By measuring, we note: In the left image, a third of the image width is covered 
by the tower. On the image in the middle, the tower covers half of the width. 
On the image in the middle, the optical zoom factor is 50% larger than on the 
left image (although on the right, the image format 4:3 shows less height). 
For the tower width on the left, 2,592/3 = 864 pixels are available, while, 
in the middle, there are 1,440/2 = 720 pixels. The optical resolution on the 
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Fig. 3.7 three extreme zoom shots. The left figure is a 10-megapixel photo. The 
middle one has only 3 megapixels, but has a 50% stronger optical zoom than the 
left photo. Right: An almost “spacey” morning mood, but it is not manipulated! 


left is thus only slightly better, despite the 10-megapixel camera. A stronger 
zoom would already provide more details in a 3-megapixel photo. Optical 
zooming increases the resolution quadratically more than an increase of the 
number of pixels. Assume we had the same number of pixels with the same 
ratio of side lengths, then the right image would provide more than twice as 
much (1.5? = 2.25) information. 


® Remark: The left image is — however this comparison may end — better than the 
image in the middle, since there is no drop in brightness at the edges (this is called 
“vignetting” and it is primarily driven by the teleconverter. In the center image of 
Fig. 3.7, the vignetting was amplified by software as it was not clearly visible in the 
original). Of course, the right image is the more interesting one because it represents 
an interesting mood, almost resembling a rocket launch. ® da 
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3.2 Similarity of spatial objects 


In space, the analogous and important result on scales reads: 


Let an object and a similar copy of it be given. Corresponding measures L in 
lengths may be in a ratio of 1: k (scaling factor k), then corresponding areas 
(surfaces) are in a ratio 1: k? and corresponding volumes are in ratio 1: k®: 


Lytigetek, Si2S=1ik, Vivek. (3.1) 


Proof: 1. Each surface can be triangulated (Fig. 3.8). Each triangle’s area on the 
surface changes with the square of the scaling factor. So this rate also applies to 
the sum of all triangular faces, i.e. going into more detail by getting closer to the 
surface. 


Fig. 3.8 surfaces ... Fig. 3.9... and volumes 


2. Each volume can be approximated in an arbitrarily accurate way by using vozels 
(“voxels” — short for “volume elements” — analogous to how pixels refer to “picture 
elements”). Consider Fig. 3.9, which shows a Lion family built from Lego bricks at 
the Expo 2000 in Hannover. For each given cuboid, the volume increases by the 
cube of the scaling factor. So the rate applies to all bodies. © 


>>> Application: gold masks 

The only non-plundered Egyptian grave chamber is that belonging to Tu- 
tanchamun, who lived around 1300 BC and 1,300 years after the time of the 
Great Pyramids. The death masks were placed over each other using onion 
skins and we see they are quite similar to each other. They were made of 
hammered gold and decorated with semi-precious stones, giving the mask a 
50% larger diameter, 1.5? = 2.25 times more than the surface area. << 


>>> Application: construction of the pyramids of Giza (Fig. 3.11) 
What percentage of the mass of the Great Pyramid of Giza (pyramid of 
Chephren, Menkaure) was installed at the time when the pyramid reached 
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one-third (one-half, three-fourths) of its final height’? What percentage of the 
surface was done at this time? 


2 aes eae 


Fig. 3.11 the pyramids of Giza at the outskirts of Cairo 


Solution: 

The following statement applies to each of the pyramids: 

We expect the missing upper part of the pyramid to be similar to the totally 
built pyramid. This residual pyramid had 2/3 (1/2, 1/4) of the final height. 
The missing mass was, therefore, (2/3)° = 8/27 ((1/2)? = 1/8, (1/4)? = 1/64) 
of the total mass. Thus, 1 - 8/27 = 19/27 (1-1/8 = 7/8,1 — 1/64 = 63/64) of 
the mass has already been installed, or about 70% (87.5%, 98.4%). For the 
surfaces, the square of the scaling factor matters. Accordingly, for the time in 
question, there is already 1—(2/3)? = 5/9 (1-(1/2)? = 3/4, 1- (1/4)? = 15/16) 
of the surface finished, this is about 56% (75%, 94%). 


® Remark: The surface of the great pyramids was originally smoothly polished and 
reflected sunlight. Thus, the pyramids offered a completely different impression 
than today. Only about 500 years ago, the precious surface material was almost 
completely removed and used for the construction of buildings in Cairo. Only the 
uppermost part of Khafre’s Pyramid is still reasonably intact. ® daa 
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>>> Application: weight comparison 

A man of 1.60m height has a mass of 50kg. What is the mass of another 
man who is 2m tall and has a similar figure? 

Because of the same density, the masses are in ratio M, : M2 and so are the 
volumes Vj : Va, 


k=2:16=1.25 > x25 Myx2-M, = 100kg. 


In practice, however, larger people often have different proportions. In fact, 
they tend to have muscles of a relatively smaller gauge.! << 


>>> Application: the surface of the Moon and Mars 

The Moon has a fourth of the diameter of the Earth, and therefore, a six- 
teenth of the surface, i.e. approximately 31 million square kilometers. This 
is almost exactly the area of Africa. Mars has a diameter that amounts to a 
little more than half the diameter of the Earth (0.532). Thus, the surface is 
little more than a quarter of the Earth’s surface (0.532? = 0.283), which is 
the area of all continents together. <<< 


>>> Application: subjective size of the Moon 

The distance of the Moon from the Earth varies between 356,410km and 
406,760 km. How many times greater in these extreme cases do the follo- 
wing appear: a) the surface of the crescent, b) the supposed volume c) the 
brightness of the Moon? 


Fig. 3.12 subjective Moon size at varying distances 


Solution: 

The given circumstances relate the respective centers of the Earth and the 
Moon. In fact, we have to subtract the Earth’s radius R = 6,370km, though 
the result is, of course, hardly affected (Fig. 3.12). We have 


1:k = (356,410km - R) : (406, 760km- R) x 1:1.144 
=> Ay: Ag=1:1.381, Vi: Vo=1:1.50. 
The area of Moon’s disc in the firmament fluctuates by almost a third, and 
with it, the radiation strength of the Earth fluctuates through the Moon as 


well. The Moon’s volume subjectively varies by as much as 50%. However, 
these extreme values are only achieved at intervals of several months. <«<« 


1The exact mathematical result, therefore, does not necessarily coincide with the various “BMI 
tables” (body mass index), which can be found on the Internet. 
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>>» Application: diver’s fears (Fig. 3.13) 

Viewed through a diving mask, submerged objects appear 4 times their ac- 
tual size (based on the length scale). So, a 1.3m long reef shark appears to 
have a length of 1.7m, but with an alleged mass increasing by a factor of 
(3)° ~ 2.4 (240%). This may be seen as a possible contributing reason behind 
reported sightings of gigantic sharks. << 


Fig. 3.13 Diver’s fears: The author faces a 2.6 m dusky shark (Carcharhinus obscu- 
rus), photo from above: Sean Hedger. 


>>> Application: diver’s reality 

The greatest danger to divers is not sharks. It is the so-called decompression 
sickness (‘caisson disease” or “the bends”). It occurs when a diver stays at a 
great depth for a long time and then emerges too fast. Due to the increased 
external pressure, more nitrogen is accumulated in the blood and in the 
tissues, while the oxygen in the respiratory air is exhausted. During ascent, 
the reduction of external pressure (Henry’s law) causes nitrogen bubbles to 
form in the blood and tissues. If they are not “exhaled” in time by a very 
slow ascent, they expand and are “caught” in the joints and tissues. This 
initially results in extreme strong joint pain (“bends”) and can be fatal if the 
diver is not brought up in time for recompression in a hyperbaric chamber 
(decompression chamber). 

Numerical example: For each 10m depth increase in water, the external pressure 
increases by 1 bar. In a water depth of 30m, there is a pressure of 1 bar+3 bar = 4 bar. 
The diver now makes a rapid ascent (> 20m per minute), for example because the 
diver’s air supply has run out. The resulting rise in nitrogen bubbles are then not 
fully exhaled and they increase in volume due to the pressure reduction. The surface 
volume is now a quarter of the external pressure; hence, it has (4bar > 1 bar) 
quadrupled. The radius — and thus the diameter — of each bubble has increased by 
a factor of V/4 # 1.6. Diagnosis: The “bends” get worse the longer the diver has been 
exposed to high pressure. daa 
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3.3 On small scales not as on large scales 


In this section, we will see that bodies with increasing size must exhibit 
differences even though they share certain external properties. Supposed si- 
milarities often lead to fallacies. 

Let us first consider the following example: 

>>> Application: optimizing the size of catalyst elements 

Which scale factor k has to be applied to a catalyst object (Fig. 3.15: catalyst 
elements for contact lens storage) so that the active surface doubles? How 
many times heavier is the enlarged object? 


Solution: 


S1:$9=1:2=1:P sk=V2 
=> Vi: Vo = My: My =1:k? =1:(V2)3 = 1: 2.83. 


Thus, the volume increases by almost three times. So, two identical catalyst 
objects have a greater surface area than a similar one with twice the mass. 


Fig. 3.14 surface enlargement through fanning 


® Remark: In order to increase the surface area of a catalyst element without in- 
creasing the mass, the element is fanned out (Fig. 3.15). Nature benefits from this: 
The antennae of male butterflies (Fig. 3.14, right) are fanned out, and the surface 
of the olfactory organ is extended. ® daa 


In general, the following important theorem is valid: 


When you magnify an object, the volume increases faster than the surface. 
Specifically: The ratio V : S = Volume: Surface is proportional to the scaling 
factor k. 


Proof: The volume increases by the factor k?, and the surface increases by the 
factor k?. Therefore, the quotient V :.S increases by the factor k? :k? =k. © 
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Fig. 3.15 catalyst elements Fig. 3.16 basic building blocks 


Examples (Fig. 3.16): 


1. A cube (edge length k-a) has volume (k- a)? and surface area 6(k-a)?, 


Vieoa 
hich yield == -—k. 
which yie 85 : 


2. A regular tetrahedron (4 equilateral triangles with side length k-a and area 
V a 
Ajh 1 1, /2 ka-A and surf: 4A, which gi — =——k. 
) has volume 3 V2 a-A and surface area 4A, which gives > aT 
3. A sphere (radius kr) has volume an (k r)? and surface area 4n(k rr)”, which 


Vor 
Its in — = =k. 
results in e°3 


4. A cylinder of revolution (radius a, height 6) has volume k? za? -b and 


V b 
surface area k?(27a-b+2n a7), which implies a 2/ - +b) i. 


5. A torus (major radius a, minor radius b, b < a) has volume k? 27a: 7b? 


b 
and surface area k? 27a-2mb, which yields ri k. 


Fig. 3.17 guttation drops as minimal surfaces 


® Remark: Of all the possible bodies of the same volume, the sphere delivers the 
maximum ratio y. The sphere belongs to the so-called minimal surfaces, which 
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differ from slightly deviating surfaces in having less (minimal) surface area (Fig. 
3.17). ® 


»»> Application: floating needles (Fig. 3.18) 

It is well-known that a water strider is supported by the surface tension 
of water. One can hardly imagine that this is also true for a steel needle. 
Although steel has almost eight times the density of water (g = 7.8g/cm?), a 
sufficiently small steel needle — if it is very carefully placed on the surface of 
water — “swims”. Find the reason for this. How much bigger /heavier could an 
aluminium needle be (9 = 2.7g/cm?)? Is a gold needle (9 = 19.4g/cm*) able 
to float? 


Fig. 3.18 floating steel needle, water striders, spider (not a water spider!) 


Solution: 

The smaller the steel needle, the greater its surface area relative to its volume 
or weight. From a certain critical length (depending on the needle’s shape), 
the needle’s surface, and thus the surface tension of the water, is large enough 
to carry the steel needle. 

A similarly shaped aluminium needle can be k = 7.86/2.7 ~ 3 times as long. 
The volume is then k? times as large, but the weight is only k? times as large, 
and thus, the surface has increased as well. 

Without much ado, the needle can be made from pure gold if it is k = 
7.86/19.4 » 1/2.5 shorter. ac 


>>> Application: less heat loss through a larger body (Fig. 3.20). 
Explain why large whales are much more likely to live in Arctic waters than 
small whales (lengths: dolphin ...2m, blue whale ...20m). 


Solution: 
Since the body shapes are fairly similar, the ratio - of the whale is & = 10 
times as large as the dolphin’s ratio. The surface of the blue whale can be 
kept warm by its circulation of blood. Therefore, it is 10 times easier to keep 
it warm. 


® Remark: The Arctic waters have lots of oxygen, and therefore, they are more 
nutritious. The largest whales are, thus, more likely to be found there. They only 
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Fig. 3.19 Arctic fox and desert fox Fig. 3.20 blue whale and dolphin (larger bo- 
(ear size) dies have comparatively less surface) 


swim in warmer waters so that their newborn calves do not freeze in the colder 
waters. ® 


® Remark: African elephants have cooling problems because of their body size. That 
is why they have big ears (surface area and of course “fan function”). 

The “cold-blooded” great white shark, which is constantly in motion (Application 
p. 217), loses so little warmth to the outside that its body temperature can be up 
to ten degrees more than the water temperature. ® 

@ Remark: Arctic foxes and desert foxes almost have the same shape. It is striking 
that desert foxes have much larger ears (Fig. 3.19). These are used for listening to 
the often almost noiseless and odorless preys (scorpions, etc.), as well as for cooling. 
Naturally, the Arctic fox has to forgo both. ® da 


Fig. 3.21 The swan is one of the largest flying birds (up to 14 kg mass). 


>>> Application: weight limit for volant animals 
Why do volant animals (especially birds) have a weight limit (10 - 15kg)? 


Solution: 

Buoyancy while flying is related to the wing-span (“tearing edge”) and wing 
surface. The change of both will be slower compared to the change of the vo- 
lume when increasing the length scale. The larger the bird, the more difficult 
it is for it to take off. Large raptors launch themselves from rocks to reach 
the initial velocity required. The non-volant ostrich (50- 100kg) would need 
a start-up speed of several hundred kilometers per hour in order to take off. 
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Fig. 3.22 bee-eaters, wasp with prey at departure, a stowaway (1mm) 


® Remark: Pteranodon (the biggest flying dinosaur) probably had a wingspan of 
about seven meters, but only a mass of approximately 15kg. It would never have 
been able — like in Jurassic Park III — to drag an object of human size through the 
air. A similar performance can only be achieved by much smaller creatures, like the 
wasp on the right of Fig. 3.22, which has paralyzed a spider and is transporting the 
spider in flight. <<< 


>>> Application: cooling of the planet 

After a huge explosion in space, two planets are produced whose diameter 
have a ratio of 1:2. Which planet will cool faster? 

The volumes have a ratio of 1: 8; the surfaces have a ratio of only 1: 4. 
That is, the larger planet has half as much surface area in proportion to its 
volume, and therefore, cools down more slowly. 


Fig. 3.23 odd cocktail “on the rocks” and ice transportation at 35° in the shade 


® Remark: Our Moon is probably 4.5 billion years old? and is believed to have 
merged from 20 smaller “moonlets” that spalled off following an asteroid strike on 
the Earth. The fact that the density of the Moon is less than that of the Earth 
may be taken as an indication supporting this theory. The larger density of the 
Earth is mainly due to the nickel-iron core. The Moon, with a diameter of only 
about a quarter of the Earth’s diameter, has, therefore, already cooled by the above 
considerations, while the Earth is still liquid inside (the frozen continents float like 


2In National Geographic, issue September 2001, there is a clear explanation for the relatively 
precise dating of the formation of the Earth. 
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a milk skin on the surface of coffee). Due to their own gravity, all comets or moons 
with a diameter of more than 500km attain a spherical shape as long as they are 
liquid inside. For example, the two quite small moons of Mars are not spherical. 

What applies to cooling generally also applies to temperature compensation. Fig. 
3.23 illustrates how much the size of a frozen block influences its melting time. 
Towards the end of the austral winter, the giant icebergs on the sea ice of the South 
Atlantic cover an area which is twice as large as Canada. The enormous amounts 
of ice have to be thawed only once (Application p. 226)! ® <<< 


The fact that during enlargement or reduction, the ratio Volume: Surface 
changes, had an enormous impact on the respective development of insects 
and warm-blooded animals: 


& 
Fig. 3.24 an elongated and a “rotund” insect 


>>> Application: maximum size of insects (Fig. 3.24) 

In general, insects are shorter than 10-15cm (the heaviest, though not the 
largest, insect is the up to 12cm-long Goliath beetle, see Fig. 3.24, right). 
Their oxygen supply is the so-called trachea, which is a system of tubes in 
their exoskeleton. The oxygen exchange takes place via the surface of this 
system. For a length over 15cm, this oxygenation is no longer enough: There 
is a mismatch in the ratio volume (weight) : tracheal surface which is getting 
too large. The ideal size of an insect is clearly in the range of 1 cm or less. 
Smaller insects are much more resistant than bigger ones. This is also due to 
the deteriorating ratio between body weight and muscle strength (compare 
Application p. 85). 

@ Remark: As always, exceptions prove the rule: There is an up to 30cm-long stick 
insect (Palophus titan) (Fig. 3.24, left), and the giant dragonflies in carbon rocks 
have a wing-span of up to 75cm! For an extremely elongated body shape, only a 
very short trachea is required, and such a trachea may then have a relatively large 
diameter. ® << 


>>> Application: surface enlargement for a slenderly built body 
The following table is intended to show how much the surface increases for a 
cuboid — starting from a cube — with a consistent volume of 1m® (and thus 
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a constant mass) which is getting slimmer. The same also applies for more 
complex shapes. The dragonfly in Fig. 3.25 needs a small cross-section in 
order to achieve high flight speeds. Surprisingly enough, the beetle can still 
fly pretty well! 


type | proportions | measures [m] surface factor | increase 
TO pas ei 6.0/m 

Tl 1it:2 0.79? x 1.59 6.3/m 5% 
T2 1:1:4 0.637 x 2.52 7.1/m 19% 
T3 1:1:8 0.5? x 4 8.5/m 42% 
T4 1eis16 0.4? x 6.35 10.4/m 73% 
T5 1:1:32 0.31? x 10.08 12.9/m 115% 
T6 1:1:64 0.257 « 16 16.1/m 169% 


Fig. 3.25 various degrees of thinness with the same mass 


>>> Application: optimal oxygen supply (Fig. 3.26) 

In contrast to insects, the bodies of higher developed animals are supplied 
with oxygen via their blood (Fig. 3.26). More blood allows more oxygen to 
be transported. The amount of blood increases in proportion to the volume 
and there is no mismatch. The oxygenation by blood thus limits the size of 
the living being but it is not the only factor — there are other factors at play 
as well (see Application p. 85). 


Fig. 3.26 veins and trachea Fig. 3.27 mammals 
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® Remark: The only problem is the accumulation of blood in the lungs: The surface 
of the lungs has to increase disproportionately, which is made possible by a rapid 
increase in the number of alveoli in large animals. Infants initially have fewer alveoli, 
and they disproportionately increase their number until the age of eight, to be ready 
by “maturity”. ® <<< 


>>> Application: minimum size of warm-blooded animals (Fig. 3.27) 

For animals with constant body temperature, there is now a clear lower 
limit for the height, namely, about the size of a pygmy shrew (Fig. 3.27) 
in mammals and the bee hummingbird (a hummingbird species with only 
2.5g of body weight) in birds. For both, there is an extremely small ratio of 
weight (and thus warming amount of blood) to surface area over which these 
animals constantly lose heat (energy). To make up for this loss, the shrew has 
to eat constantly. Smaller shrews from earlier geological periods, therefore, 
subsisted partially on nectar, extremely high-energy food. The same holds 
true for the hummingbird. <<< 


For similar reasons, one can explain why large animals in relation to their 
mass are much weaker than smaller animals: The strength of a muscle is not 
dependent on its volume but on the cross-section: 


Pease 


x > ah ~ ow 


Fig. 3.28 an ant and an elephant at work 


>>» Application: relative body strength (Fig. 3.28, Fig. 3.29) 

An ant can carry many times its weight. Although an elephant is objectively 
incomparably stronger, it can only carry much smaller loads relative to its 
weight. In addition, an elephant already needs very thick legs (due to a 
disproportional cross-sectional enlargement of the muscles), in order to bear 
the enormous weight. 


® Remark: This is even true when comparing relative strengths of small and large 
articulate animals: 5 mm long jump spiders can jump much further (in relation to 
their size) than the quite tremendous (and 10,000 times heavier) tarantulas. @ 

Numerical ecample: Pygmy Shrew: 4.3 - 6.6cm (without the tail) with 2.5 - 7.5, 
elephant: up to 3.5m and 4,000kg. If the elephant were as slim as the shrew, it 
would weigh less than 2, 000kg. <<< 
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Fig. 3.29 Left: It took minutes to overtake the millipede — the short legs of the 


victim are strong levers. Right: An ant tries to escape a meat-eating sticky plant 
(sundew, Drosera capensis). 


Fig. 3.30 One cannot get much smaller. Left: Escaped after being trapped in a cat’s mouth! 


If the ratio V2 : V, of the volumes is given, then the similarity factor is 
calculated by taking the cube root: 


k = V/V/V. (3.2) 


Then, one can again use the formulas (3.1). 


>>> Application: allometries in the animal kingdom 

Biologists are understandably cautious when using the term “similarity” and 
instead use the term “allometry” in this context, when certain organs are not 
in the same proportion as most others. Nevertheless, a domestic cat (5kg) 
and a leopard (50kg) are not too allometric in spite of their similar shape 
(Fig. 3.31). How do their shoulder heights respectively relate to their coat 
surface or their tread surface (i.e. the surface of their paw prints) ? 


Solution: 
Since their bodies have the same density, the ratio of the masses equals that 
of the volumes. 


kh? =10>k= VY10=2.15, k? 24.6. 


The shoulder height has a ratio of 1: 2.15. The coat surfaces or the treading 
surfaces have a ratio of 1: 4.6. <<< 
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>>> Application: similarity between a pup and a dam 

Relatively little allometry can be seen in the two rhinos in Fig. 3.32 and the 
tigers in Fig. 3.33. In the middle image of Fig. 3.33, it can be seen only at 
second glance that a “baby tiger” slurps water. The difference in size to the 
mother can only be estimated from the image on the right. How can the 
masses of the young animals be guessed? 


Solution: 

Fig. 3.32: Since the rhinos in both images are standing in parallel, length 
measurements (for example, the length of the spine) can be compared quite 
well. The scaling factor is about k » 0.75. The mass of the baby animal is 
k® of the mother’s mass, so at a good 40%. Since adult rhinos have a mass 
of 1,500 - 2,000 kg it is likely that the cub would weigh around 600-650 kg. 
On the right, the shoulder height of the baby animal may be k = 0.4 of his 
mother’s shoulder height, resulting in a mass of about k? = 0.4? = 0.064 
1/16, which equals about 100kg. 

Fig. 3.33: Here, the length dimensions are not directly comparable. If the 
shoulder height of the young tiger were 45% of the shoulder height, the 
masses would be in the ratio 0.452: 1° * 1: 10. Siberian tigers are the largest 
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Fig. 3.33 similar Siberian tigers (low allometry) 


living carnivores. Even the females have a mass of 100 - 170kg. So, young 
tigers must have a mass of about 15kg. 


Fig. 3.34 two ants of the same species (worker, guardian) 


® Remark: The ants in Fig. 3.34 are directly comparable. The larger guardian ant 
is about 1.5 times as long as the worker ant, and in absolute terms it is 1.5? = 2.25 
times stronger. Relatively speaking, it has a mass that is 1.5? » 3.4 times as large 
as the mass of the worker ants. It is, thus, weaker than the smaller ant. ® <<< 


>>> Application: wild comparisons 
Compare the animals in Fig. 3.35 and Fig. 3.36 and consider the differences 
in the functioning of their various organs. 


Solution: 


Fig. 3.35: The whirligig beetle is a predator at the water surface with a 
rigid and extremely resistant exoskeleton. It has compound eyes and floats 
— “translating” its body length to that of a human — at a rate of 500km/h 
with 60 beats per second. Like almost all insects, the beetle can even fly. The 
seal is warm-blooded and has ordinary eyes like humans (only with a specific 
focal length for underwater vision, see Application p. 52). It swims through 
serpentine body motion (flexible inner skeleton)! 

Fig. 3.36: The flower chafer uses its antennae to smell, chitin hair for pol- 
linating plants, and can fly excellently. Water buffaloes have their horns as 
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a defence mechanism, hair as refrigeration, and safety features — and quite 
obviously, they cannot fly. <<< 


>>» Application: more wild comparisons (Fig. 3.37) 

A giraffe with a height of 6 m can have a mass of up to 1,600 kg. How much 
mass does a 2 m high baby giraffe have? What would one estimate the mass 
of a 12 cm long sea horse to be? 


Solution: 

Young giraffes have comparatively shorter necks. Thus, we should compare 
the calf with a giraffe of height 5 m. With 2/5 of the height, one can estimate 
(2/5)? ~ 1/16 of the mass, i-e., 100 kg. 

Now the comparison with the sea horse. Here we really just get a rough 
estimation. The length proportion is 1 : 50, thus the volume (and mass) 
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Fig. 3.37 A geometrical resemblance is certainly there — including the mane or the 
dorsal fin — but that is about it ... 


comparison should be 1 : 50° > 1, 600/125, 000 kg or 13 grams. (Sea horses 
have masses between | g and 1 kg, depending on the species.) 


® Remark: As a species, we rely disproportionately on our eyesight — a predilection 
that makes us prone to judging by appearance. Sea horses and giraffes may be 
extremely different animals, and yet, one would be hard-pressed to deny a superficial 
resemblance beyond the mere difference in size.? Some even conclude, on the basis of 
appearance alone, that humans have fish-like ancestors — but looks may be deceptive, 
and our naive intuitions are often mistaken. Incidentally, evolutionary biology has 
already settled the question of what caused the giraffe’s famous neck to grow so 
long: it did not evolve, as it is often believed, because the animals fed on trees 
that grew their foliage on increasingly higher branches in a desperate arms race to 
survive in the African savanna. Rather, their necks seem to have been molded by 
the ubiquitous mechanism of sexual selection. ® <<< 


3G. Glaeser, H. Paulus: The Evolution of The Eye, Springer Nature, 2015. 
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3.4 Centrifugal and gravitational forces 


In this section, we want to derive some rather interesting findings about astro- 
nomy and space travel by means of proportionalities. Newton had already 
realized that there is a mutual force of attraction between two bodies that 
increases linearly with respect to their masses, but decreases with respect to 
the square of the distance. Moreover, we know that the centrifugal force that 
acts on a rotating body increases quadratically with its track speed while 
the distance to the center of rotation enters the formula for the attraction 
force only linearly. The entire universe is based on the interplay between 
attraction and centrifugal force: Planets, moons, and satellites pursue their 
course through space and arrive precisely to the second at pre-calculatable 
positions. 

® Remark: The gravitational force is the decisive force only “on a large scale”. This 
is because the force is a product of mass and acceleration. The smaller the ob- 
jects are, the smaller is the influence of the gravitational force. Insects can already 
blithely crawl on vertical or even overhanging walls against gravitational forces. 
Mini-mosquitoes or mini-spiders can be airborne in the wind for kilometers. 

In the molecular region, there are very different forces that are of significance. There 
are enormous forces that hold molecules or even just atomic nuclei together, these 
dominant forces are at the nanoscale. Their effect ends abruptly “at the edge” of 
the molecule or atomic nucleus. @ 


>>> Application: astronomical calculations 
Given the following data: 


diameter [km] ] distance to Earth’s center [km] | density [kg/dm?| 
Earth E 12,740 6, 370 5.516 
Sun § 1,390, 000 149, 500, 000 1.409 
Moon M 3,470 384, 000 3.341 


How do the volumes, the masses, the forces of attraction act (relative to the 
Earth’s surface)? 

We relativize these absolute values: The lengths are given in terms of the 
Earth’s radius. Density, volume V, and mass M of the Moon and the Sun are 
given as multiples of the respective measures of the Earth, and the attraction 
of the Sun and the Moon are in relation to the gravity of the Earth to a certain 
point on the Earth’s surface: 


radius | distance d | density o V | M=oV | F=M/ a 
E 1 1 1 Si i 
S 109 23, 500 0.26 | 109? = 1.3-10° | 332,000 | 0.0006 
M 0.27 60 0.61 0.27? = 0.02 0.012 | 0.000003 
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An astonishing result concerning the force of attraction: The Sun has about 
200 times the pull of the moon! Nevertheless, the tide of the Earth is de- 
termined by the Moon, rather than the Sun! The reason for this is that the 
attraction by the Sun is largely “eliminated” by the centrifugal force of the 
Earth’s orbit around the Sun. <a 


>>> Application: the gravitation on the surface of a celestial body 

On Earth, we have a gravitational acceleration of 1 g. What is the acceleration 
on another planet or on the Sun with & times the diameter and d times the 
density? 

Solution: 

The mass of the body increases with dk. Gravitation decreases at the same 
time with the square of the distance from the center (factor 1/k?). Altogether, 
the increase is, thus, dk. 

Since Mars, for instance, has a lower density than the Earth (d * 0.7) and 
its diameter is about half (k = 0.532), we can expect little less than 40% of 
the gravitational force on its surface. On our Moon (d * 0.8, k = 0.25), we 
have 20%. For both Jupiter (& = 10) and the Sun (k& = 100), we have d# 1/4. 
Thus, we can expect 2.5 g or * 25 g, respectively. <<< 


>>> Application: track speed of a satellite 

Satellites are orbiting the Earth at altitudes of 150km. To be in “equilibrium 
of forces”, the force of gravity (weight G) and the centrifugal force F’ must 
“cancel” each other out. Now, we want to derive a relation between linear 
speed, orbital period, and altitude. 

Let R be the radius of the Earth (R = 6,370km), and r the distance between 
a satellite (mass m) and the center of the Earth. With respect to the Earth’s 
surface, the satellite has weight Gr = mg. The attraction decreases with the 
square of the distance from the Earth’s midpoint (Newton). At a distance 


2 
r =kR, the attraction is thus reduced to the weight G, = Gr: (2) = oR, 


For the centrifugal force, we apply the well-known formula F = mv?/r. With 
F =G,, the airspeed at a distance r equals 


=mg(—) sy? g=R an > kv g R= constant. 


So, we have 


: k k 
kv = 9.81-6.37- 10° = vp Vk ® 7,905— = 7.90 — = 28, 460—~ 
S S S 


or 
_ 7.90 km _ 28,460 km 


Vk OS aie Te” 


(3.3) 


Ur 
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The factor k is calculated from the altitude H by 
r R+H 


R R 


The circumference of the Earth is Up = 40,000km. With increasing height, 
the circumference — in this case, the trajectory of the journey — increases 
linearly in k: 


U, =kUR. 


Thus, the so-called “sidereal period of rotation” T (relative to the fixed stars) 
equals 

U, _ k- 40,000 km 
7.9 km 

Vk Ss 


The results will be compiled in a clearly arranged table. 


T= 


wk? 5060s ~ k8/?-1.41h. (3.4) 
Ur 


H [kn] 150 | 500] 2,000 | R—6,370 | 31,850 | 377,000 
k=(R+H)/R | 1.024 [ 1078 | 1.314 2 6 60.3 
v, [km/b] 28,140 | 27,300 | 24,830 | 20,120 [11,620 | 3,665 
period [hl 146| 157| 2.12 3.99 | 20.7 660 


We recognize that the greater the altitude, the lower the airspeed will be 
and the longer the orbital period will be as well. If we compare two satellite 
orbits (ky; and kz), then, by Formula (3.4), we have 


Ty: Ty = ke? eS? = 72: 1? = 8 kB = 73 er. 


Thus, we have verified for a circular orbit: 


The squares of the revolution periods are proportional to the cubes of the radii 
(Kepler's Third Law). 


>>> Application: sidereal and synodic orbital period of the Moon 

The last column of the table above relates to the orbit of the Moon, which can 
also be seen as a satellite of the Earth with an average distance of 384, 400 km. 
You have to remember that the common center of gravity of the Moon and 
the Earth is not the center of the Earth, so that you should rely only on a few 
decimals. The Moon takes about 660h * 27.3d in sidereal time to complete 
an orbital period around the Earth. The period from a new moon to the next 
(a “synodic month”) takes longer because the Moon must also compensate 
the Earth’s rotation during this rather long time. It sums up to * 29.5 days. 
® Remark: The Moon increases its distance annually by a few centimetres, thus it is 
getting slower. Millions of years ago, the Moon appeared bigger in the Firmament 
and solar eclipses were more pronounced. ® <a 
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»»» Application: relative position of a satellite? (Fig. 3.38) 

When plotting the position of a satellite on a global map, the result becomes 
a wavy trajectory whose shape depends on the imaging method used for the 
map (yet, in no case, would it look like a sine curve, as one might suspect). 
The trajectory leaves the map somewhere on an edge and enters on the 
opposite edge at an appropriate place. This is, of course, due to the fact that 
the world map must be “cut to pieces” somewhere. 

Until now, we have not taken into account that during the orbital period of 
the satellite, the Earth rotates. If the satellite rotates above the equator in 
the same direction as the Earth, then it cannot be seen from an observation 
station B after precisely one rotation, because B has already turned further 
by this time (per hour 360°/24 = 15°). Therefore, the satellite may require 
additional time. Consequently, the trajectory of the satellite is “out of phase” 
with the next orbit. 


® Remark: To be more precise: The Earth rotates exactly once around its axis 
every 23h56 min = 23.933h (a sidereal day), so the Earth turns 15.04° per hour. 
The remaining 4 min is needed for the Earth to turn back to the same position 
relative to the Sun: Within one day, the Earth has made * 1° of its orbit about the 
Sun. (This value varies slightly in the course of the calendar year.) ® 

Now, if we place Fig. 3.38 “under the microscope’, we see that the parallel 
shift of the wave train is in increments of 23.1° longitude. This corresponds 
to 1.54h of the orbital period. Using Formula (3.4), k is calculated and the 
flying height H is consequently deduced as well 


3/2 1 Ath = 1.54h > k = 1.058 > H = R(k-1) » 373km. 


Furthermore, the track speed equals v, * 27,600 km/h according to Formu- 
la (3.3). The values of H or v,; closely match those instantaneous values given 
in Fig. 3.38 (in fact, these values vary slightly). <<< 


>>» Application: geostationary satellites (Fig. 3.39) 

In Application p. 92, we saw that high-flying satellites fly more slowly in order 
to overcome the already small gravitational force by means of centrifugal 
force. When a satellite flying above the equator (in the same direction as 
the globe’s rotation) has an orbital period of exactly 23.93h (see above), 
it will remain above the same spot on the equator. In fact, there are now 
almost 300 (!) satellites “positioned” above the equator: weather satellites 
and especially communication satellites (mostly TV-satellites). This means 
that one satellite follows the next at an approximate distance of only one 
degree of the central angle. For such a “geostationary” satellite, we derive the 
following condition with Formula (3.4): 


4See http: //spacef light .nasa.gov/realdata/tracking/index.html — with kind permission of 
NASA. 
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Fig. 3.38 In a rectangular map, the clas- Fig. 3.39 More than 300 geostationary 
sical trajectory resembles a collection of satellites and only one orbit for them ma- 
repeatedly shifted sinusoids. kes it a little bit crowded up there ... 


T = k3/? 1.41h = 23.93h = k = 16.977/3 ~ 6.60. (3:5) 


This corresponds to a distance of 6.60: R from the center of the Earth or an 
altitude of 5.60-R  35,700km. The associated track speed is, in accordance 
with Formula (3.3), still 11,000 km/h. The word “positioned” must, therefore, 
be seen as applying only relatively! 

A geostationary satellite has to fly above the equator: Each satellite’s orbit 
does indeed have the center of the Earth as its center. However, a “fixed 
point” on a non-available equator point rotates in a plane which does not 
contain the center of the Earth.® <<< 


>>> Application: mental maths: the Moon vs. geostationary satellites 
The well-known orbit of the Moon (orbital period » 27 days) (Application 
p. 93) allows us to make a quick estimation of the distance of the geostatio- 
nary satellites (orbital period 1 day). 


Solution: 


. 3: : 2 ; = . Q)2 gral 
(Pesestat 7 TMoon ) = (1 . 27) = Teeostat * "Moon = el ‘ 3) = Teeostat © 9 °/Moon: 
ddd 


>>> Application: airborne weight loss 

If we were to fly a plane at a speed of 28,460 km/h (altitude would be very 
small, speed sidereal and not relative to the ground), we would — like the 
astronauts in the space station — become weightless. 

Of course, this is not possible because our plane would burn up in the dense 
atmosphere. Therefore, we suppose that we fly at cruising speed, that is, at 
about 830 km/h “ground speed”. If we do this above the equator “in line with 
the Earth’s rotation” (i.e. flying to the east), we arrive, together with the 


OCA 1,670, at a sidereal 


speed of a point above the equator of ohh 


5 An interesting website is http://www.sat.dundee.ac.uk, where you can find new daily recor- 
dings from geostationary satellites from all over the world. 
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speed of 2,500km/h. This corresponds to about 1/12 of 28,460 km/h. The 
centrifugal force increases or decreases at the same radius with the square 
of the speed. Therefore, the force on the plane equals 1/12? = 1/144 of the 
force that would annul our weight. Our weight in the aircraft is, thus, almost 
one per cent lower than in the hibernation state at the North Pole (sidereal 
velocity 0). With a mass of 70kg (weight force approximately 700N), this is 
about 5N. 


® Remark: Leonardo da Vinci (1452-1519) still believed that the weight of a body 
is dependent on its speed. As “proof”, he argued that a horse with a rider moving in 
full gallop can stand shortly on one leg. His theory was refuted a century later by 
Galileo Galilei (1564-1642). After all: Leonardo’s daring theory contained a (very 
small) grain of truth if we stick with the example of the riding horse: It is at least 
true that the rider is lighter when he rides toward the east ... ® <<< 


>>> Application: when the centrifugal force is dominant 

Fig. 3.40 shows the rapid rotation of a human performing multiple somer- 
saults. Three revolutions per second with a radius of perhaps 30 centimetres 
is very fast for us. After all, the centrifugal force is then already so strong 
that one’s hair almost exclusively obeys the centrifugal acceleration a. Find 
out what the size of a is. 


Fig. 3.40 The centrifugal force can easily outdo other forces. 


Solution: 
We use the well-known formula a = rw?, where r is the radius in meters and 
w is the angular velocity, and get 


a = 0.3m: (6r/s)? » 120 m/s? » 12g. 


Humans can stand 12-fold gravity only for a short time! 

Twice as many rotations per second produce four times the acceleration, 
which is more than a person could cope with: The person will at least: have 
internal bleeding due to blood emerging from the veins. 

A technique in which this is utilized: Dryers that tumble with 900 revolutions 
per minute (or 15 per second) and have a drum radius of 30 cm produce 
accelerations of 300g, extracting every drop of water from the laundry. <<< 
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>>> Application: The king’s chamber in the Great Pyramid (Fig. 3.41) 
The Cheops pyramid was originally 146m high. The height of the King’s 
Chamber was chosen so that the horizontal plane through the sarcophagus 
produces a cross-section of the pyramid whose area is exactly half the surface 
of the pyramid’s base. How high is the horizontal plane? 


Fig. 3.41 the horizontal plane through the King’s Chamber 


Solution: 

Let h = 146m be the total height of the pyramid and A be the area of the 
base. Furthermore, let hz be the height of the horizontal plane above which 
there is a similar pyramid whose base is Aj = A/2. We find 


hg:h= Ag: A= hg = h/V2 * 103.25m. 


Thus, the horizontal plane is at an altitude of hy = h—- hg = 42.75m. 

® Remark: The subsoil of the pyramid — a square whose 230 m long sides are precisely 
aligned with the cardinal directions — was leveled up to a precision of centimetres 
before the beginning of the construction process. This was before the invention of 
the spirit level or the compass ... ® <<< 


>>» Application: misleading diagrams (Fig. 3.42) 
In the “statistics” in Fig. 3.42a, there are three similar barrels, and the ratio 
of their heights and radii can be defined as 


12032 1.3". 


They intend to illustrate that an oil company has increased its production 
in the last two years by 30% (k = 1.3) in every sector. However, the volumes 
of the barrels have ratios of 


121972183") w1s9 924, 
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Fig. 3.42 a) falsifying chart, b) and c) correct charts Fig. 3.43 stock prices: dis- 
tortion in the ordinate sug- 
gests comparable losses 


Thus, the diagram in Fig. 3.42a pretends an increase by 120% each year. The 
diagram in Fig. 3.42b, however, is correct because the radii of the barrels were 
left unchanged. 

Even the information conveyed by Fig. 3.42c is correct, because both the 
radius and height are multiplied by the factor k = </1.3. 

® Remark: Fig. 3.43 shows the price development of the shares of two major banks 


during the real estate crisis in late 2007 (as of 9. 1. 2008). At first glance, it seems 
“caught” between the same two banks. A closer look at the scale shows: The upper 
bank varies between the values 42 and 58 (which is almost 40%, based on the lower 
value); the lower one is between 116 and 94 (which is “only” a little more than half). 
The “fever chart” looks similar, because of the assignment of equal fluctuation ranges 
(in the mathematical sense), but they are stretched along the ordinate. ® <<< 


>>> Application: ostrich and chicken eggs compared (Fig. 3.44, left) 

An ostrich’s egg looks like a large chicken egg, but has about 24-times the 
mass of a chicken egg. What is the ratio of the diameters and what is the ratio 
of the surface areas? Why do ostrich eggs need extreme heat for hatching? 


Fig. 3.44 ostrich and chicken eggs in comparison 


Solution: 


M,: Mz =1:24=>V,:V2=1: 24 (equal consistency) 
= dy: dy =1: Y2461:2.9 => $1: S2=1: (24)? 81:8.3. 


The ratio of the surfaces: The volume of the chicken egg is much smaller 
than that of the ostrich egg (using the factor k = 2.9, see Application p. 80). 
In order to heat the entire interior of the egg, a long exposure to a higher 
outdoor temperature is required. Therefore, ostriches live mainly in the hot 
semi-deserts of southern Africa. 
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® Remark: How long do you have to cook an egg? This question depends not only 
on whether you want to have the egg soft-boiled or hard-boiled, but also on the size 
of the egg. Large eggs take longer! ® <<< 


>>» Application: ground pressure (Fig. 3.45) 

Another comparison between ostriches and chickens: Both birds prefer to 
run, rather than take to the air (the ostrich uses its wings only to maneuver 
at rapid speeds). Which bird has the greater ground pressure? 


Fig. 3.45 tread surfaces or ground pressure 


Solution: 

A statement can be made immediately: The young domestic chickens in Fig. 
3.45 (right) have a lower ground pressure than their mother (see Application 
p. 86): They are smaller, and if they are of similar shape, their tread surface 
is larger in relation to their weight (pressure = force / area). One would 
be tempted to think now that an ostrich weighing 50-100 kilograms would 
produce significantly greater ground pressure. After all, the ostrich might 
weigh twenty or thirty times the weight of an adult domestic fowl. On the 
other hand, the foot of an ostrich has oversized dimensions (Fig. 3.45, left) 
and it is expected to offset maybe twenty to thirty times the tread weight. 
Thus, the ground pressure of both birds is approximately the same. 


® Remark: A bunch of ostriches look like certain dinosaurs (for example, the rap- 
tors), because they are quite similar with regard to their shape. These protozoa 
were probably also able to run quickly at a pace that they could possibly sustain. 
However, the latter property requires the animal to be warm-blooded. © 


® Remark: A man has a relatively large contact surface (Fig. 3.44, right, Fig. 3.46, 
left). Things change once he sits on a bike (Fig. 3.46, top left). ® 


® Remark: Having stated the above, an elephant has a smaller ground pressure than 
a cow. A giraffe weighs a little more than a cow, and has slightly larger hooves. So, 
their ground pressure is comparable. The Anopheles mosquito in Fig. 3.46 on the 
right stands on four legs. However, the ground pressure produced by the mosquito 
given with comparable physique and tread surface would be merely 1/500 of the 
ground pressure produced by a giraffe (1 cm versus 500 cm). At this scale, adhesion 
prevails, and the mosquito can, therefore, dangle from the ceiling. @ <a 
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Fig. 3.46 Left: man, bicycle, ox, horse. Center and right: two extremes 


>>» Application: camera sizes (Fig. 3.47) 

The chip size of a digital camera is a criterion for the image quality. Many 
professional cameras have a “full-frame sensor”. Meanwhile professional ca- 
meras with a “crop-factor” are also being offered on the market. A sensor 
size of 50%, for instance, does not necessarily mean a camera has only half 
the body size. The cameras pictured below have a width of 15.5 cm and 
12.5 cm. What weight difference is to be expected for the bodies and the 
lenses respectively? 


Fig. 3.47 Camera sizes and weights of two professional cameras and two comparable 
lens configurations (left: telezoom up to 300 mm, right: macro lenses). 


Solution: 

The factor is k = 15.5/12.5 » 1.24. If all measurements (including the thick- 
ness of the metal) are scaled equally, this should result in a weight difference 
of k° 1.9. Due to the fact that the larger camera has a comparatively larger 
battery, the weight factor is about 2.2. When it comes to the lenses, the diffe- 
rence should be bigger: A lens with a focal length of f = 300 mm, for instance, 
needs to have only f = 150 mm for the smaller sensor. This would result in 
an extreme weight difference (2° = 8!). Telelenses, however, are “mainly air”, 
and therefore, the weight difference is usually 2 to 3. <<< 
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>>> Application: Tetra Pak in various sizes 

A drink is to be launched on the market in “Tetra-Pak” packaging, respec- 
tively containing 0.25 liters and 0.5 liters. How many times more packaging 
material does a “Tetra-Pak” of 0.5 liters need? By what percentage must the 
edge length of the tetrahedron (Fig. 3.48) or the cuboid (Fig. 3.49) increase? 


Fig. 3.48 Tetra Pak in various ... Fig. 3.49 ...sizes and shapes 


Solution: 


ViiVea1:2 > k= Y2=1.26 > $1: S.=1:k =1: Y4=1.59 


So you need about 59% more packing material for the half-liter drink. The 
edge length increases by 26%. aac 


Fig. 3.50 Droplets distribution. Right: 64 small droplets must unite to create a 
droplet of fourfold diameter. 


>>> Application: spherical dew (Fig. 3.50) 

If left to cool and condense, the morning dew that attaches itself to leaves 
(right) and cobwebs (left) produces wonderful constellations of approxima- 
tely equally sized water droplets. These droplets are spherical due to surface 
tension, which aims to enclose the highest possible volume. Their individual 
sizes are so similar because of cohesiveness, which causes molecules to be at- 
tracted towards “strategic centers” such as equidistant points. The droplets 
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on the cobweb have a radius of about 1-2 mm, just like the smallest droplets 
on the leaf. If 64 droplets join together, they produce a larger drop of fourfold 
diameter (right). <a< 


>>> Application: dissolving a powder in a liquid 

The same quantity of an effervescent powder is to be dissolved in two glasses 
of water. For the first glass, the powder is coarse; for the other glass, the 
particles of the powder are half the diameter. How many times faster will the 
finer powder dissolve than the coarser one? 


Fig. 3.51 dissolving a powder in theory ... Fig. 3.52 ...and in practice 


Solution: 

The radii have a ratio of 2: 1, the volumes are 8: 1, and the surface areas 
are 4:1. In the finer powder there are 8 times as many spherules available, 
each with 7 of the surface of a larger spherule (Fig. 3.51). Overall, all the 
small spherules have, thus, twice the surface area, and the liquid will initially 
dissolve twice as fast. Here, the radii of the large and the small spherules 
changes, and other conditions occur: The small spherules become smaller in 
proportion. The above example is only correct if the radii of the spherules 
do not change. <<< 


>>> Application: dipping a cylinder (Fig. 3.53) 

An arbitrary axially symmetric cylinder (such as in Fig. 3.53 on the left) is 
immersed vertically into water. How deep does it subside? 

Note: The cross-sections are not only similar, but even congruent. The volume 
of a cylinder equals 


volume = cross-section x height. 


Solution: 

Let A be the cross-sectional area, h be the cylinder height, 9 be the density, 
and t be the depth of immersion. Then the mass of the cylinder Mz = 9: A-h 
and the mass of water displaced is My =1-A-t. According to Archimedes’s 
principle, we have 


M,=My => o0-A-h=At>t=h-o. 
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Fig. 3.53 dipping a general cylinder or a cylinder of revolution 


® Remark: If the cylinder is not fixed, it will eventually — depending on its shape 
and the specific weight — even at small disturbances (wave motion) assume a stable 
equilibrium position, such as on the right in Fig. 3.53. We will examine this physical 
process more accurately in Application p. 218. ® <a 


>>> Application: dipping a conical object in water 

A fully axially symmetric cone (density @ < 1) floats 

a) with the top down or 

b) with its base down 

in the water. How deep does it plunge? What percentage of the lateral surface 
is wet? 

Solution: 

Fixing some unit of measurement, let the volume of the cone be V and its 
mass be M = o-V. We apply Archimedes’s principle: The weight of the body 
is equal to the weight of the displaced fluid (water: Qy = 1). Mass and weight 
are proportional (G = M -g); hence, the law also applies to the masses. 
First case a) (Fig. 3.54a): 

The penetration depth is t; units (0 < t; < 1). The volume V,, of the ousted 
water is the volume of a similar cone with V, = V-¢?. Again, according 
to Archimedes’s principle, the displaced weight before and the total weight 
afterwards are equal: 


My=M > V-=0-V => ti= ¥. 
For the lateral surface, we find 


total: moistened = 1:t7 =1: ¥/¢?. 


In Fig. 3.54a, we have @ = 0.5. This results in t; » 0.79 and ¢? ~ 0.63, that is, 
79% of the body’s height is immersed in the water and 63% of the surface 
area is wet. 

Shifting our attention to the case b) (Fig. 3.54b): 

The cone stands h units out of the water. The volume (and hence the mass) of 
the displaced water is then V,, = V(1—h®), and thus, following Archimedes’s 
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Fig. 3.54 immersing a cone in water 


principle, we have 


V(1-h®)=0-V => h=*V/1-0. 


The penetration depth is 


pela pet= 1p. 


For the surface area, we get 


total : moistened = 1: (1 =f) =1:(1- ¥(1-0)?). 


Setting o = 0.5, Fig. 3.54b results in tg * 0.21 and 1: (1—h?) » 1: 0.37. That 
is, 79% of the body height is above the water and only 37% of the surface 
area is wet. Compare these results with the case of a). 


® Remark: Upon immersion of other bodies, such as a ball, the displaced volume 
is not similar to the immersed body, and so you have to use other, more complex 
methods — see Application p. 283. Generally, one must always remember that a 
body tries to adapt a stable equilibrium position during immersion. The cone will 
tilt under certain conditions! This reminds us of an iceberg, which protrudes by 
about one-tenth of its volume out of the water. It is often observed that an iceberg 
begins to rotate unexpectedly in order to take a different, more stable position. ® 

ddd 


>>> Application: Why do big air bubbles ascend faster? (Fig. 3.55) 
Every diver knows: Big air bubbles ascend faster than small ones (Fig. 3.55). 
The rule of thumb is the following: Ascending faster than 20 m per minute 
should be avoided — and this also applies somehow to small air bubbles. 


Solution: 

We first simplify the situation and assume that bubbles are spherical. Then, 
larger bubbles paradoxically have — in relation to their volume — less surface 
and less cross-section. Due to Archimedes’s principle (Application p. 64), the 
buoyancy force equals the weight of the displaced water bubble, whereas the 
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Fig. 3.55 One major, but simplified, rule for divers: Do not ascend faster than the 
small bubbles — no matter what happens. 


cross-section is mainly responsible for the friction force. One could compare 
the situation with the free fall of small and big spheres of the same material 
with extreme air-resistance (Application p. 21): The big spheres, then, are 
faster than the small ones. 


Fig. 3.56 Ascending air bubbles change their shape all the time. The big bubbles 
roughly develop the shape of spherical caps and resemble the shape of jellyfish. 


When looking at photo series or videos of ascending water bubbles (Fig. 3.56), 
one notices two things: Firstly, big bubbles look like spherical caps (Appli- 
cation p. 411), and secondly they “explode” and form new smaller spheres 
every once in a while as they ascend. This might have the following explana- 
tion: Small bubbles have a spherical shape due to their surface tension. Since 
they ascend at a lower speed, they are barely deformed. When the bubbles 
get bigger due to decreasing pressure, the streaming dents the sphere on the 
back. 


® Remark: Application p. 77 explains that air (80% Nitrogen) is dissolved under 
pressure into the blood system of the diver. If the diver ascends too quickly, the 
air cannot be fully exhaled by the lungs and nitrogen embolism can cause serious 
damage. ® da 
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>>> Application: large and small people 

Two similarly constructed human beings have a mass of 50kg and 75kg, 
respectively. Since their masses (and hence their volumes) have a proportion 
of 1: 1.5, we can say the following according to the above statements (see 
Application p. 497): 


e Their body sizes form the ratios /1: 1.5 = 1: 1.14. Thus, if the lighter 
body is, for instance, 160cm tall, the heavier body will have a height of 
about 183 cm. 


@® Remark: A newspaper article from March 27, 2007: “The 2.36 m tall Chinese 
Bao Xishun has found the love of his life. The happy woman is a whole 68 cm 
shorter than him.” The next photo shows two reasonably equal slim people. 
The husband was about (2.36/1.68)? ~ 2.75-times as heavy at the time of their 
wedding. ® 


Your blood levels and liver capacities behave like volumes. So, if the lighter 
person is drinking 2 glasses of wine, the heavier person can drink 3 glasses 
and feels about the same effect. In practice, of course, other components 


play a role. 


A smaller person gets cold faster than a heavier person (bad ratio of Volu- 
me: Surface). One should always bear this in mind when going for a walk 


with kids in winter. PPP 


>>> Application: How many ants weigh as much as an elephant? 


Fig. 3.57 threatening gestures in different scales 


Solution: 

With the best of intentions, these two cannot be considered similar (Fig. 
3.57)! The following calculation is, therefore, initially wrong: The elephant 
and the ant have — like most living creatures — the specific weight of water. 
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Their lengths, therefore, behave like k » 3.5m: 7mm * 500. The ratio of their 
volumes (and masses) is k® # 500° « 125 million. However, the ant has a much 
more fragile build. The ratio is, therefore, greater. So, we adjust upwards and 
estimate: An elephant weighs as much as 1 billion ants ... 


® Remark: Now, let us continue with our estimations: The surface of the Earth is 
about 150 billion km”. Of these, 120 million km? are colonized by small animals like 
ants. Just eight ants per km? are sufficient to counterbalance an elephant. In the 
world, there are about 100,000 elephants. They are “counterbalanced” by 800,000 
ants per km? or 0.8 ants per m?. The total mass of all insects and invertebrates 
(worms, etc.) currently living on Earth is greater than the total mass of all other 
currently living animals! Among the mammals, bovine lifestock, i.e., cows, are the 
absolute leader. Cattle biomass is several times greater than that of humans. Cows 
even weigh more than all the ants on this planet taken together. Animal biomass 
accounts for only a tiny fraction of plant biomass though (less than 1%). This 
amounts to an estimated 2 trillion tons,® i.e. 2-10! kg. With an average density 
of water (1kg/dm*), this corresponds to a “water cuboid” with dimensions 22km x 
22km x 4km. (Our oceans are on average 4km deep.) Floods of this magnitude may 
be caused by an underground earthquake which results in a tsunami. ® da 


>>> Application: bound carbon 
Biomass can be measured in different ways, but in strict scientific applicati- 


Fig. 3.58 By far the most bound carbon can be found in plants (left: 15 m high 
trees, right: 30 cm high bushes). 


ons, it is defined as the mass of organically bound carbon (C) that is present 
in a given environment. The total live biomass of bacteria probably exceeds 
that of plants and animals.’ The further up animals are in the food chain, the 
more its total biomass tends to decrease. In oceans, biomass usually follows 
the food chain phytoplankton —> zooplankton —> predatory zooplankton 


6Maxeiner, Mirsch: Life counts, Berlin Verlag, Berlin 2000. 
“G. Glaeser: Nature and Numbers. Birkhaiuser/deGruyter, 2013. 
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— filter feeders —+ predatory fish. Terrestrial biomass generally decreases 
significantly at each higher level (plants —> herbivores —> carnivores). 

Trees, for instance, possess a vastly larger amount of biomass than all animals 
together. Fig. 3.58 on the left shows trees that are 15 meters tall. The right 
photo depicts bushes with a height of 30 cm. Thus, one can conclude that 
they should have roughly 100,000 times more biomass (50° = 125,000), as 
the trees are 50 times taller. <<< 
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Fig. 3.59 Stag beetles fight for a female. The air supply plays an important role. 


>>>» Application: Are larger animals preferred by evolution? 

(Fig. 2.51, Fig. 3.59) 
Male specimens of the stag beetle resemble each other, even if their size varies 
between 35mm and 80mm. Two males (50mm and 60mm length) fight for 
a female. How many times heavier or stronger is the larger one? Why might 
the stronger beetle still face a problem? 
Solution: 
The scaling factor equals k = 60/50 = 1.2. So, the larger males are k® » 1.73 
times heavier and as much as 1.2? = 1.44 times stronger (see Application 
p. 85). Yet, their air supply via the trachea may be 1.2 times worse. In 
addition, the stronger beetle will require more force to control its extremities 
(longer lever). 


® Remark: While you can let yourself be bitten in the finger by a male — since 
the lever of its antlerpliers is large, the same experiment is not advisable for much 
smaller females. These antlers must be short, because they act as effective pliers in 
rotten trees “where they eat their way” in order to lay eggs. ® 


® Remark: The playful fight (in Fig. 3.60) between two elephants is typically a 
“showdown” lasting for many hours in which body mass plays a crucial role. The 
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young bull Abu from the Schénbrunn Zoo in Vienna has a mass of about 1800 kg. 
His mother is — as can be estimated from the picture to the left — 25% taller, and 
thus, twice as heavy. Abu’s jostling ended fatally for a zookeeper in February 2005! @ 


a dwt “a 


Fig. 3.60 battle of giants: For non-heavy- Fig. 3.61 Collective mimicry: 
weights, this is very dangerous. Larvae of a beetle form a bee. 


>>> Application: collective mimicry (Fig. 3.61) 

In the Mojave Desert in California, there is a beetle (Meloe franciscanus) 
whose 2mm long larvae congregate in hundreds on top of a straw to col- 
lectively mimic the shape of a female bee and, thereby, develop additional 
fragrances to attract male bees. Before the bee discovers the mistake, many 
larvae will already cling onto its body, and they thus manage to get trans- 
ported to the bee’s nest, where they will feed. How many larvae can mimic 
the body of a 14mm long bee? 

Supplementary question: If the first “transport” takes 25% of larvae and the 
rest of the larvae again try to form a female bee, how long will this bee be? 


Solution: 

The aspect ratio is about 7:1. The larvae appear somewhat more elongated 
than a bee, so that it is difficult to set their shapes in relation. In order to 
achieve the volume of the bee, the ratio is 7? : 1 = 343: 1. Thus, there are 
several hundred larvae involved in this. If there are now 25% fewer larvae, 
the shrunken volume leads to a scaling of the figure by a factor of k = 3/4, 
and thus, the length changes by a factor of ¥/k # 0.91. 

The newly formed shape is now 12.7 mm long instead of the initial 14mm. <««< 


>>> Application: When and how did the universe originate? 

With super-telescopes, one can make out stars from the Earth even if they are 
some 3 billion light years away (practically, this means that one is looking into 
the past). The light of more distant stars is no longer able to travel through 


8See https: //www.nps.gov/moja/learn/nature/upload/201204M0JAscience. pdf 
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the atmosphere. With space telescopes (e.g., the Hubble Telescope), one can 
make out stars that have only 1/50 of the limit light intensity. Since brightness 
decreases with the square of the distance, you can thus, theoretically, see stars 
that are /50 * 7 times further away (more than 20 billion light years). 


® Remark: Since the universe originated 12-15 billion years ago by the “Big Bang”, it 
should be possible to observe this! Indeed, scientists presume that some images show 
only 300,000 years old parts of the universe. During the period, so it is believed, 
light was able to escape the “primordial soup” for the first time. @ da 


>>> Application: linear expansion of bodies when heated 

(Application p. 24, Application p. 384) 
A body is heated by the temperature At with volume Vj, so that its volume 
increases to V2 according to the formula 


Vo =V;, (1+ 7At). 


How big is the linear expansion? 
Solution: 


We have k? = 2 =1+/7At. The scaling factor for lengths (widths, heights) 
1 


L /V: 
is thus k = a a a The “expansion formula” is, therefore, 
1 


1 
Ly = L, Y1+ At. a 


>>> Application: “magnification” for macro lenses 

Before the advent of digital cameras, one thing was clear: A macro lens with 
magnification 1:1 could take a rectangle with a frame of size 36 x 24mm”. 
What are the rules for digital cameras now? 


The chips of digital cameras have different sizes today. The extremely expen- 
sive chips of professional cameras still have the “classic” format of 36x24 mm?. 
These are followed by mid-priced chips with about 22 x 15mm?. This allows 
us to fill rectangles of the same size using the same lens. 

Specifically: If I take a picture of a 24 mm long beetle using a professional 
camera with a large (expensive) chip of 8 megapixels and another one using 
a smaller chip (but also of 8 megapixels) then — assuming that the same 
lens is used — the image recorded by the smaller chip has a higher resolution 
(factor (36/22)? » 2.6)! However, one has to consider two things: Firstly, the 
quality of the picture does not only depend on the number of pixels, but is 
also strongly affected by the quality of the chip. Larger chips deliver (even) 
better results. Secondly, smaller chips indirectly extend the lens’s focal length 
(which is in fact linked with the classic size 36 x 24mm7), in our case by a 
factor of 36/22 * 1.6. This, in turn, causes a significant (quadratic) decrease 
in the depth of field (Application p. 54). The professional camera will still 
emerge as the winner ... <<< 
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>>» Application: sucker cups? (Fig. 3.62) 
Surface tension (Application p. 80) is similar to adhesion — which is the 
mutual attraction of particles to the surfaces of various substances. 
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Fig. 3.62 adhesion makes various substances possible 


Geckos have small extensions on each toe. These are composed of small, clo- 
sely spaced suction pads, which, in turn, carry up to a thousand branched 
hairs that are microscopically small (half a million per foot approximately). 
Thanks to this ingenious system, these animals can wait for prey on over- 
hanging walls. Yet, the exploitation of adhesion will soon reach its limits, 
because the increased contact area cannot keep up with weight gain. A lar- 
ger and heavier animal cannot have the incredibly large number of hairs that 
would be necessary on each foot to enable it to adhere to a wall. Besides, 
animals of increasing size lack the relative force that is needed to consistent- 
ly overcome of gravity. Nonetheless, there have been experiments with tape 
trying to imitate the “Gecko-principle”.® << 
>>> Application: Can large animals jump higher? 

Explain the following paradox: In principle, all animals with good jumping 
conditions (long jumps correspond to leg proportions and muscle distributi- 
on) have the potential to jump about two meters high. This is true for insects 
as well as large mammals. For small animals (with a body length of less than 
5cm) the limit height is strongly reduced by air resistance, and the resistance 
increases as the animal gets smaller (Application p. 21). 


Solution: 

First of all, when dealing with a high jump, we always look at the center of 
gravity, which is higher for a larger animal. We will treat the “standing height” 
separately in our calculations. Furthermore, in dealing with accelerations, we 
are always concerned with how long a force can act. 

We need the following rules: 

(1) The jump height depends on the square of the initial velocity. When the 
body has taken off, only the force of gravity acts. Duplicating the speed at 
the top means quadrupling the jump height! 


See www.welt .de/data/2003/06/03/106227 .html 
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Fig. 3.63 bounce in small and large animals 


(2) The initial rate is greater, the longer the acceleration through the muscles 
takes place. The duration of the acceleration depends on the length of the 
legs. 

(3) The acceleration results from the relation Force = Mass x Acceleration. 
We first substitute (3) into (2) and subsequently (2) into (1), so we find: 
(4) The jump height depends on the square of the expression 

(Force x Time)/Mass. 

The force, as we know, increases quadratically with increasing body length, 
the time depends linearly on the leg length, and therefore, linearly on the 
body length. The terms Force x Time and mass increase cubically as the 
length increases linearly. This leads to an amazing intermediate result (which 
we must subsequently adapt a little bit): 

The absolute jump height does not depend on the height of the body. 


® Remark: In fact, the skip of the small Galagos (small lemurs that have a mass of 
only one tenth of a kilogram and are also known as “bush babies”) is almost as high 
as a leopard’s jump. 

The “prehistoric horse” (Propalacotherium) was the predecessor of today’s horse and 
had a head-body length of 55 cm (with a shoulder height of 30 cm). If a present-day 
horse can lift its center of gravity by 120 cm and thus jump over a hurdle of 220 cm, 
then the Propalaeotherium could lift its center to the same extent (but it could only 
jump over 140 cm high hurdles). 

From this example, it is also clear why the Cuban Javier Sotomayor’s legendary 
world record high jump of over 2.45 m has remained intact since 1993. He must 
credit his success to his enormous size of 2.08 m, amongst other factors: It is not 
because taller people can jump higher, but because his center of gravity lies higher! 
In addition, he has very long and accordingly trained legs and a very slight torso. 
Now as regards small animals (of less than 5 cm): Even the widest jumping gras- 
shopper can reach only a quarter of the jumping height of larger animals despite the 
fact that these insects are true professionals in jumping. This is due to air resistance: 
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Fig. 3.64 Jumping force does not count here, but the exit speed does! 


A small beetle can achieve a falling speed of 3 meters per second. It cannot be more 
than that, simply because the air resistance slows them down immediately. Now 
going back to physics: If I throw a stone up at 3 meters per second, it flies as high 
as 45 cm! To throw the stone 2 m high, I have to throw it up at almost 7 meters 
per second! 

Even if a small insect were able to jump at 7 meters per second (theoretically, 
it could), it would be immediately slowed down by air resistance and its altitude 
would thus be drastically reduced. Therefore, it simply makes no sense for an insect 
to jump away with so much energy. 

Rat fleas, which are only 2 to 3 millimeters in size and were once feared as carriers 
of the plague, are truly gifted jumpers that can reach heights 50 times greater than 
their body size. Their top speed is less than two meters per second, though this is 
already their top speed in free fall. 

Dolphins and larger whales may skip several meters high out of the water. The ma- 
ximum jumping height is reached when the animals shoot out of the water vertically. 
Here it is all about the maximum speed under water, but not the bounce! Thus, if 
an arbitrarily large cetacean (or even a penguin) can make a jump perpendicular to 
the water surface at 10 m/s, its center of gravity rises about 5 feet from the water 
according to Formula (2.18). @ <<< 


>>>» Application: Do large aircraft need relatively less fuel? 


Solution: 

Assuming that both large and small commercial aircraft seem fairly similar, 
the question is quickly answered: Yes, they need a relatively smaller amount 
of fuel. A plane that can carry 320 passengers needs to be only twice as large 
as one that carries 40 people (40-23 = 320). Then, it has only four times the 
air resistance, and the per capita consumption is half as much for the small 
model. 

® Remark: Large commercial aircraft can fly even higher than small ones and they 


have a lower air resistance. On the other hand, the ascent requires an enormous 
effort, which is why the situation is clearly only suitable for long haul flights. 
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Numerical example: Lufthansa has indicated a fuel consumption of an average of 
4.4 liters per person on a 100 km route. The wide-bodied Airbus A380-800, however, 
consumes 3.4 liters. The planned new Boeing 787 is supposed to get through with 
as little as 2.5 liters. ® <<< 


Fig. 3.65 length runs ... 


>>> Application: in shipping: “length runs” (Fig. 3.65) 

Solution: 

One criterion for water resistance is the width of the hull, and it increases in 
similar constructions only with the root of the submerged hull. 

Another criterion is the area of the surface that is in contact with the water, 
and this decreases proportionally with increasing body length L. Occasio- 
nally, one even finds the formula for the maximum velocity Umax * 95/ L 
nautical miles/hour (= knot). 


® Remark: In a downhill race, much longer skis are used than in the technical 
disciplines, again because “length runs”. For the interview with the winner, the skis 
are replaced by shorter ones so that the brand is clearly visible inside the frame of 
TV screens. ® <<< 


>>> Application: transmission matters (Fig. 3.66) 
Explain how the depicted electric screwdriver works and determine its angu- 
lar velocity if the electric motor makes 3,500 revolutions per minute. 


Solution: 

Let us consider the inside of the screwdriver. A 3.6 volt battery (Fig. 3.66, 
left) provides the necessary electricity to turn the small electric motor with a 
speed of about 3,500 revolutions per minute. The motor shaft propels a small 
gear with only six teeth (Fig. 3.66, middle). This so-called sun gear transfers 
its rotational momentum to three rotationally-symmetric positioned plane- 
tary gears. The planetary gears are positioned so that they are connected to 
a fixed outer gear with 48 teeth. (Their number of teeth — in this case 19 
each — depends on the sizes of the sun gear and outer gear and has no direct 
impact.) This causes the centers of the planetary gears to rotate with one 
eighth (6:48) of the drive shaft’s angular velocity. 

The centers of the planetary gears form a fixed equilateral triangle. On its 
reverse, in a second step, a further 6-tooth sun gear propels three other 
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Fig. 3.66 left: electric screwdriver, middle: the small sun gear transfers its rotational 
momentum to three planetary gears, right: the axes are fixed, and the outer gear is 
allowed to rotate. 


planetary gears with the same transmission ratio as before. The connecting 
triangle between the planetary gears in the second layer is finally connected 
to the axis of the screwdriver. The 8-8 = 64 revolutions of the electric motor 
thus cause a single turning of the screw (roughly 3, 500/64 = 55 revolutions 
per minute). 

® Remark: This explains the large angular momentum M. Given the motor’s con- 
stant power P, we can apply the equation P = M-w, where w is the angular velocity. 
Planetary gears have many applications in technology, such as in gearboxes, cable 
winches, and bicycle hub gears. 


Another variation on the same principle is also notable: If the axes of the planetary 
gears are fixed and the outer gear is allowed to rotate, then it rotates significantly 
slower than the drive shaft (Fig. 3.66, right). ® <<< 


>>» Application: Where do the similarities lie? (Fig. 3.67) 
The fern on the left exhibits remarkable similarities to the right image of tire 
tracks in the snow. But where do the actual similarities lie? 


Solution: 

There is a middle line from which two sets of branches extrude at roughly 
constant angles. The similarity is further improved by the extreme perspecti- 
ve of the photo on the right, as the distances between the branches appear to 
shrink in the direction of the tip. The fern had to be photographed from the 
front in order to appear sharp in the picture. If we now compare the bran- 
ching distances, we will come to the conclusion that they follow an arithmetic 
series. By and large, each distance follows from the previous one by the addi- 
tion of a constant, and after a finite number of points, this distance converges 
towards zero. The tire tracks display a typical projective scale. The angle and 
composition is comparable to a picture of train tracks running towards the 
horizon. This means that, in theory, an infinite number of branching points 
would fit in the image. 
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Fig. 3.68 Live or dead matter (2)? 


® Remark: Fig. 3.68 represents an extreme comparison: On the left, you can see how 
plants sprout from gravel. On the right, you can see patina on a copper plate at a 
magnification of a hundred times (micrograph, Institute of Chemistry, University of 
Applied Arts Vienna): Dead matter might also be found on Mars — only at a small 
scale though, certainly not in any greater quantity! @ <<< 


>>> Application: the Reynolds number (Fig. 3.69) 

The correlation via the dimensionless Reynolds number Re = te describes the 
quotient of inertial force to viscous force acting on a body in a fluid (e.g., air 
or water).!° This number increases in proportion to both the object’s size 1 


10G. Glaeser, H.F. Paulus, W. Nachtigall: The Evolution of Animal Flight. Springer Nature, 
Heidelberg/New York 2017. 
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and the flow velocity v, and it decreases (indirectly proportionally) with the 
kinematic viscosity v of the fluid. The latter depends to a large extent on the 
temperature. At room temperature, for instance, it is 15 times greater in air 
than in water. Therefore, objects moving at a certain flow velocity in water 
have a Re number 15 times greater than objects moving at the same velocity 
in air. 


Fig. 3.69 Is there any chance to compare this mathematically? Same Re-numbers 
= comparable conditions! Left: Landing of a blue tit, right: water flea. 


If two bodies have the same Re numbers, they are comparable in terms of 
fluid mechanics. Let us say a water flea moves at a Reynolds number of 300. 
Then, its inertial force is 300 times greater than its viscous force. This number 
is derived from a body length of /=3 mm, a “jump” velocity of v = 10 cm/s, 
and the corresponding value for the kinematic viscosity of water. 


® Remark: If one wishes to study a model of a water flea (Daphnia pulex) that is 
ten times larger, the surrounding water must flow at one-tenth of the velocity in 
order to arrive at the same Re number and the same balance of forces. Alternatively, 
a wind tunnel could be used: Then, the model must be surrounded by air moving 
at a flow velocity that is about 15 times higher (1.5 m/s), which is easier to achieve. 
If the experiment succeeds, one could combine tests in water with tests carried out 
in air. One medium allows us to determine one parameter, while the other medium 
lends itself to determining another parameter. Such procedures are commonly used 
in fluid-mechanical measurements. @® <a 
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>>> Application: form follows function (Fig. 3.70, Fig. 3.71) 

The smallest hummingbirds and hummingbird moths may be compared with 
horizontal propellers, at least in terms of size, weight, wing shape (no curva- 
ture), wing surface and also wing load, wing motion, and the principle of lift 
to rise in the air. 


Fig. 3.70 form follows function: bird vs. insect 


It could also be said that they fly in the same range of Reynolds numbers. 
Conversely: Within the specified Reynolds number range, certain physical 
conditions need to be provided for an optimal hovering flight. These conditi- 
ons, which match the ones listed in the first sentence, have been met by two 
very distinct biological lineages. The biologist would consider this as a case 
of convergent evolutions and analogous structures. In technology, this would 
qualify as an example of the phrase “form follows function”. 


Fig. 3.71 form follows function: fish vs. mammal 


® Remark: Fig. 3.71 shows how mammals (dolphins) converged to shark design 
within millions of years. Sharks are, evolutionary speaking, much older than whales. 
Mesoniz, the ancestor of dolphins, was a terrestrial animal that went into the water 
to feed. ® <<< 


4 Angles and trigonometry 


In this chapter, we deepen our understanding of right and oblique triangles. 
We will also discuss various applications of angular or trigonometric functi- 
ons. 

The first part is the connecting tissue that binds the subsequent sections 
together. The Pythagorean theorem has been proven for 2,500 years — its 
underlying relation has been known for much longer than that, for it illumi- 
nates the relations between angles and trigonometric functions. The inverse 
of the theorem — namely that a triangle is right if the Pythagorean formula 
can be applied to it — is important for some proofs. 

In the second section, we will discover that mathematicians prefer to measure 
angles in radians rather than degrees. We will further show that our eyes 
tend to measure angles rather than distances, which has consequences for 
the subjective impressions of photographs. 

The following section deals with trigonometric functions and their inherent 
relations, and also answers the question of how these functions were used to 
calculate in antiquity. We will then discuss calculations in oblique triangles, 
which usually derive from applications of the Law of Sines and the Law of 
Cosines. This will allow us to solve various problems directly. 

The last section includes a vast array of practical applications of the principles 
discussed in the preceding material. 


© Springer International Publishing AG 2017 
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4.1 The family of Pythagorean theorems 


The Pythagorean theorem in the right triangle (catheti a, b, hypotenuse c) 
reads 


a? +b? =c’. (4.1) 
, 
c? 
c b 
b a 
Fig. 4.1 Pythagorean theorem Fig. 4.2 figure for the proof 


Proof: The proof behind the theorem is of such great importance that it has been 
conducted throughout history in over 200 (!) different ways. The following proof is 
among the easiest to understand. 

In Fig. 4.2, the area of the square can be written in two different ways: 


b 
A=(a+b)?=a7+2ab+b? and A=e +45, 


from which, through equalization, the formula (4.1) follows directly. © 

Fig. 4.1 illustrates that the sum of the areas of the squares above the legs 
is equal to the area of the square above the hypotenuse. The same applies 
to the three similar figures (such as half-circles) that are placed above the 
triangle sides. 

The converse statement of the Pythagorean theorem reads as follows: 


If a? + b? = c? applies to a given triangle, then this triangle is right. 


Proof: Let us conduct the proof “indirectly”. 

We assume that a? +b? =c? is true and the triangle is not right (Fig. 4.3). This 
leads to a contradiction. (In mathematics, if a statement is not false, then it is true. 
There exists a form of “black-white thinking” that is grounded in the principle of 
consistency, or the absence of contradictions. Conversely, a proof is not valid if at 
least one of its inferences is not true under all circumstances.) 

By inscribing the height h onto 6b, a triangle may be interpreted as the sum or 
difference of two right triangles AWB and BCH. In both, the (already proven) 
Pythagorean theorem applies, or 


(1) ce =(b4d)?+h? and (2) a? =d?+h?. 
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We subtract (2) from (1) and get 
c -—a? = (b4d)*-d? =? + Qbd. 


According to our initial assumption, however, c? — a? = b? holds. Thus, 2bd = 0 
and d = 0 must be true. This means that H = C and that the triangle is, indeed, 


right. © 


Fig. 4.3 proving the converse Fig. 4.4 measurement in a rice field 


>>» Application: Egyptian triangle (Fig. 4.4) 

The ancient Egyptians were aware that a triangle with sides 3, 4, and 5 is 
right. They took a rope, made 12 knots at equal distances, banded them 
together into a noose, and stretched it across three corners (at the knots 
0 = 12, 3, and 7). This enabled them to split their rice fields into right- 
angled sections. The front view of the Chephren pyramid actually consists of 
two Egyptian triangles! 

® Remark: The ancient Indians also knew of a right triangle with the side lengths 5, 
12, and 13. The search for right triangles with integer sides was of great interest to 
the Pythagorean School (Pythagoras 580 - 496 BC). Their solutions are commonly 
known as “Pythagorean triples”: 


[k(m? —n), 2kmn, k(m?+n?)],  k,m,n are integers 


k =1, m= 2, n= 1 produces the Egyptian and k = 1, m = 3, n = 2 the Indian 
triangle. ® <<< 


>>> Application: King’s chamber in the Cheops pyramid (Fig. 4.5) 

The Cheops pyramid was built more than 4, 500 years ago, that is, 2,000 years 
before Pythagoras. We find within it several examples of Pythagorean triples, 
one of which is found in the granite King’s chamber (Application p. 97). 
The measurements of the right triangle inscribed in the figure correspond 
to 15, 20, 25 Egyptian ells. The height of the chamber is not rational, but 
apparently corresponds to V15? — 10? » 11.2 ells. <<< 


The triangles ABC, ACH, and CBH (Fig. 4.6) are similar (equal angles). 
From the proportions ag: h=h:be, c:a@=a@: de, and c: b= 5b: be, we infer 
the altitude theorem and the cathetus theorem 


debe = ie and Cac = a’, Che = b?. (4.2) 
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Fig. 4.5 burial chamber, the only Cheops (Khufu) statue (12 cm), granite quarry Syene 
In addition, the two catheti theorems incidentally yield 
a+b = C (ae + be) = ce. 


Thus, the “Pythagorean theorem” is shown in a second way. 


Fig. 4.6 altitude and cathetus theorems Fig. 4.7 Thales’s theorem 


>>> Application: construction of the golden ratio and approximate 
construction of the number 7x (Fig. 4.8) 

Explain the two specified constructions of LJ in Euclid’s figure (see Appli- 

cation p. 37) or the approximate construction of 7 up to two decimal places 


using the method of d’Ocagne. 


112+ 1512 


Fig. 4.8 construction of two famous numbers 


Solution: 
For the construction of the golden ratio, we have 


AB’ = (1/2)? +12 =5/4 > AC = AB = V5/2. 


In order to verify the approximate construction of 7, we calculate 


DE =17+12=2=> DF=DE= V2, 
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FG =DF 41°23 FH = PC=3. 


=> DH =DF+FH=V2+V3%3.14627 (Error 0.15%). 


ddd 


Furthermore, the fundamental theorem holds: 


Thales’s theorem: Each angle in a semicircle is a right angle. 


Proof: Even this significant theorem — Thales (ca. 630 - 550 BC) was one of the 
first Greek mathematicians — has already been proved in numerous ways. One of 
these proofs indicated the equation as a special case of the “angle of circumference 
theorem” (Fig. 4.52). Here is another proof (Fig. 4.7): 


The triangles AHP and BHP are right: Thus, the Pythagorean theorem applies 
twice: 


AP’ = AH +HP =(r+2)?+y?, 


BP =BH +HP =(r-a)?+y’. 


Consequently, 


AP’ + BP =(r?+2re +224") +(r° Qrax +27 +") = 


= 2(a? + y*) + 2r? = 4r? = AB’. 
ae 
r2 


The Pythagorean theorem also applies to the triangle ABP. Thus, the triangle has 
a right angle. © 

>>» Application: road bridge (Fig. 4.9) 

Bridges often have arched structures. These arcs may be circular, parabolic, 
or “catenary shaped” (according to statics, the catenary below forms a “sup- 
port line arc” — see Application p. 430). In our case, let the arc be circular 
(which has the advantage that it includes congruent components). Given the 
span s and a maximum height h, we want to find the circle radius r and the 
column height ¢ at a distance x. 


Fig. 4.9 Bloukrans River Bridge, South Africa 


124 Chapter 4: Angles and trigonometry 


Solution: 
Following Thales’s theorem, the triangle AHT is right, so the altitude theo- 
rem (geometric mean theorem) is applicable (s/2)? = h(2r —h). By rearran- 
ging the theorem, we can now calculate r 
2 2 
5 = orh- ho r= ee, 
8h 

The pillar height at a distance x results from the auxiliary rectangular trian- 
gle PQM according to the Pythagorean theorem t = r—y=r-Vr2-ax?. «<< 


(4.3) 


>>> Application: tree of Pythagoras 


Even in antiquity, repeating the Pythago- 
lak. rean figure in Fig. 4.1 with recursive sca- 
~~" ling, i.e. fractal-like, as shown in Fig. 4.10, 
-h seemed like an obvious thing to do. The re- 
sulting figure shows no sharp contour line as 
1. the depth of the recursion grows. Zooming in 
- does not change this. A nice generalization 
can be found in the model below (Fig. 4.11) 
Fig. 4.10 the tree of Pythagoras designed by F. Gruber and G. Wallner. 


Fig. 4.11 a generalization of the tree of Pythagoras <<< 
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4.2 Radian measure 


In mathematics, one rarely computes with the familiar units of degrees (the 
subdivision of the circle into 360 parts comes from the Babylonians), but 
rather with radians (Arcus): 


The radian » of an angle w® is defined as the length of the arc on the unit 
circle corresponding to the central angle y°. 


For y® = 180°, we get the half circle with the length 7. Thus, 


= — 4.4 
p= 180° or p=~p—_: (4.4) 


Specifically, the radian y = 1 corresponds to the angle y° = 180°/m » 57°. For 
the corresponding circular arc on a circle of radius r, we have 


b=7¢. (4.5) 


Fig. 4.12 radians Fig. 4.13 calculation of the circumfe- 
rence of the Earth 


>>> Application: calculation of the circumference of the Earth (Fig. 4.13) 
Even the ancient Greeks knew that the Earth must have a spherical shape 
(this knowledge was then lost for a long time). Already in the 3rd century 
BC, Eratosthenes was able to make an astonishingly accurate calculation of 
the Earth’s circumference (!), and he did so as follows: The ancient city of 
Syene was located almost exactly on the Tropic of Cancer (near the present- 
day Aswan). It was probably well-known that there was a well in which, only 
at noon on the day of the summer solstice, one could see the Sun directly 
above one’s head. That is, precisely on that day, at that time, and at that 
particular place, the Sun’s rays hit the ground of the well vertically. By 
means of amazingly accurate measurements, Eratosthenes found out that, at 
the same time in Alexandria (800km north of Syene), the Sun’s rays had 
an angle of incidence of 7.2° (measured from the vertical). Assuming that 
the surface of the Earth is spherical, the following simple proportion can be 
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determined: 


b=ry => 800km =r (7.2° 535) > r~6,370km => U ~ 40,000km. 


® Remark: There had been indications that the Earth has a spherical shape since 
ancient times: Navigators who sailed toward the equator along the West African 
coast gave accounts of an ever-changing starry sky. The calculation of E'ratosthenes 
was obviously not a “proof” of the spherical shape, but demonstrated that the 
surface must be curved. Empirical evidence can be provided as follows: The first 
assumption was that the Sun’s rays are always “parallel”. 

In reality, the Sun is not a point light source and the various light rays form an angle 
of up to half a degree. If an objects casts a shadow, it is framed by a “penumbra 
aura” which becomes easier to see the farther the object is away. For the sake of 
simplicity in our calculations, we want to assume that the Sun’s rays are “parallel”. 
If we draw comparisons now with many other places, we will, of course, get the same 
result. Even then, the Earth could theoretically still have the shape of a “spindle 
torus” which arises when a circle rotates about one of its chords (the sphere is 
formed by rotating a circle about one of its diameters). You ought to consider that 
photographs of the Earth from space have only recently been available! © <<< 


>>> Application: What is a thumb width? (Fig. 4.14) 

To communicate the positions of objects that are far apart, we use terms like 
“thumb widths”, as in: “Do you see the black spot two thumb widths next 
to the striking group of trees”. Knowing the approximate distance d of the 
object, we can estimate its size. What is the size of an object corresponding 
to “one thumb width” at dm distance? 


Fig. 4.14 a “thumb’s width” Fig. 4.15 How far is it to the plane? 


Solution: 

With an outstretched arm, our thumb is r = 60cm away from our non- 
squinting eye. The thumb by itself is about 6 = 2cm wide. The radian of the 
visual angle vy is, therefore, y = — » — (the viewing angle does not change 

r 

much for bigger or smaller people, because even arm length and width remain 
roughly proportional to the thumb). This applies to the width of the “thumb- 
wide” object at a distance d meters, and thus, we have 
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Dam. 
30 


Conversely, we can estimate the size of an object whose distance is known: 
A house may have one side of length 12m and may be “half of a thumb’s 


width” high. From this, it follows that the house is d = 720m away, since 
Bm_1.1 


d ~ 2° 30° 
® Remark: One can also work with “palms”: One palm (the width of the hand at 
the transition of the metacarpus to the fingers) has a viewing angle of 8°, regardless 
of age and gender. 


The Sun and the Moon appear about less than half a degree in the firmament. If 


A 
the aircraft in Fig. 4.15 is 100m long, then since 100m = a r, the aircraft must 


be about r = 14km away from the observer. ® <<< 


>>> Application: How long is a mile? 


Solution: 

The length of a nautical mile is defined as the length of one arc minute (1/60 
of a degree) on the equator. Therefore, 40,000km corresponds to 360° = 
360-60’ and 1 nautical mile corresponds to 40,000 km/360/60 = 1.852 km. 

@ Remark: This unit at sea is actually very useful: Angles to stars can easily be 
measured on the high seas. Ships move, and when they go “straight”, the trajec- 
tory is seen locally as a great circle on the globe, that is, a circle with the same 
circumference as the Earth. 

One kilometer (1,000m) was previously defined in terms of the circumference of 
the Earth, namely as 1/40, 000 of it. Since 1983, the definition of the unit of length 
has depended on the definition of the second: A meter is the distance that light 
traverses in 1/299, 792,458 of a second within vacuum. ® <<< 


>>> Application: curved perspective (Fig. 4.16) 


Fig. 4.16 left: classic view; right: curved perspective 


If we place ourselves close to a “big” object, for example, in front of a wall (Fig. 
4.16, middle), then the visual angles (in the picture denoted by () change for 
the equally long segments AB and ApBo depending on our distance to the 
object. If we interpret the visual angles a and (£ as new units of length, we 
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get a completely different perspective image of a scene (Fig. 4.16, right) than 
we would obtain by classical photography (Fig. 4.16, left). It corresponds to 
the cylindrical projection of the image projected onto the spherically curved 
retinal image — or the subjective perception when the object is “scanned” by 
a rolling of the eyeball. Viewed mathematically, instead of plotting metric 
lengths, we simply lay off the arc length of the azimuth angle a and the 
elevation angle G. The vertical rod AB as shown in Fig. 4.16 (left) may be 
located at a = 60° to the right of the principal direction. The lower end point 
A lies in the horizontal plane through the eye (horizon). The bar appears at 
an elevation angle of 6. Which coordinates do the endpoints of the perspective 
image of the rod have? 


Fig. 4.17 Degas painted a curved perspective. 


Solution: 
The radians of a and 6 are 


7 7 T T 

a= 60 ino a 1.047 and $=15 130° 12” 0.262. 
Thus, A and B have the coordinates A(1.047/0), B(1.047/0.262). The image 
can of course be enlarged by scaling all of its lengths. Applying such a coor- 
dinate transformation to all image points, one obtains a curved perspective 
as in Fig. 4.16 (bottom right) or in Fig. 4.17 (right). 
® Remark: Some famous artists used this technology in order to produce curved 
perspectives (Fig. 4.17). At the expense of linearity, this would allow one to avoid 
an extreme perspective distortion (see also Fig. 4.16, right). 
Insects can see by conical facets. Therefore, they preferably measure objects in 
angles. The insect’s visual impression of a scene seen with compound eyes (Fig. 
4.18) is expected to correlate to the curved perspective just described. 
Other curved perspectives are produces by so-called “fisheye lenses” (Fig. 4.19, left; 
Fig. 2.2; Fig. 4.42), or by reflections in curved surfaces, especially in spheres (Fig. 
4.19, right). ® <<< 


>>> Application: area of a spherical triangle (Fig. 4.20) 
Let ABC be a spherical triangle, bounded by three great circles (center = 
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Fig. 4.19 fisheye perspective, perspective in the eye of the beholder 


center of the sphere). The sum of the three angles a, 3, and ¥ is always 
greater than 180°. Measured in radians, the “spherical excess” a+ 8+y-7 is 
then exactly the area of the doubly curved triangle if the sphere has radius 1 
(otherwise, the area increases with the square of the radius). Verify this 
formula for the blue area in the right image of Fig. 4.20. 


Fig. 4.20 The area of a spherical triangle equals the “spherical excess”. 


Solution: 
In the special case of the right image, all three angles are 90°, i.e. 7/2. The 
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spherical excess is then 37/2-—7 = 7/2. The triangle covers 1/8 of the sphere’s 
surface, which is (with r = 1) 47/8 = 7/2. 

@® Remark: Areas on the sphere are quite essential, because our planet is spherical. 
In order to determine the area of a continent, one has to work with “spherical 
triangulations”. ® <<< 


>>> Application: How often does Greenland fit into Africa? (Fig. 4.21) 
If you ask people to estimate estimate, the solutions offered to this question 
tend to be of the order of “four to six times” or so. The correct answer, 
however, is 15 times! This has to do with the fact that Africa is “around 
the equator” and Greenland is far North. The rectangular maps of the Earth 
distort Greenland much more than Africa. 


Fig. 4.21 The distortion in the rectangular map of the Earth makes area com- 
parisons hard. In the right image, another great circle takes over the role of the 
equator. Both Africa and Greenland are close to the equator, and the size compa- 
rison is more realistic. Compare also Australia and Antarctica in the left and the 
right rectangular map. 


® Remark: Even on the “usual” rectangular map, one can see that Africa and South 
America were once joined together. This, however, is only possible since both con- 
tinents are close to the equator. Australia and Antarctica used to be one continent 
as well, but this cannot be seen on such extremely distorted maps. ® <<< 
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4.3 Sine, cosine, tangent 


The trigonometric functions sine, cosine, and tangent play major roles in 
mathematics. They are the basis of all calculations based on angle measure- 
ment. 

Let us consider the unit circle: A point P on the circle can be described in the 
Cartesian coordinate system with the coordinates (x,y). The corresponding 
polar angle is y (Fig. 4.22). Then, we define: 


y sing 


v=cosy, y=siny, tanyp=—= , (4.6) 
x cosy 


Fig. 4.22 sine, cosine, tangent Fig. 4.23 sin(+y), sin(180° + y) 


The Pythagorean theorem implies the following important relation: 


sin? y + cos” yp = 1. (4.7) 


The ratio tang: 1 = siny: cosy shows that, on the plane, the line parallel 
to the y-axis (vertical) on the right is tangent to the circle at x = 1 (see Fig. 
4,22). 

The cotangent function is rarely used (because there is no such button availa- 
ble on pocket calculators). However, it is only the reciprocal of the tangent: 
1 cosy 


cot p = —— = — , 
tany siny 


From Fig. 4.23, the following relations can be read off: 


sin(180°-y)= sing, sin(-y) =-siny, 
cos(180° — y) = — cosy, cos(-y) = cosy, (4.8) 
tan(180° - y) = -tany, tan(-y) = -tany. 


We now consider the right triangle OP, P in Fig. 4.22. It is built 
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e by the cathetus A with length x adjacent to the angle y, 
e by the cathetus G of length y opposite to the angle y, and 


e by the hypotenuse H with length 1. 


The following holds true: 


cos Y = siny = <, tany = c (4.9) 


H’ 


These equations are valid in any right triangle with a leg angle y. Consider 
similar triangles! 


Thus, in right triangles, we have 


H G 
ial 


Furthermore, we can apply a = 90°-. By swapping the notations, we obtain 
the important relations 


A=Hocosy, G= Hsing. (4.10) 


cosa =sin(90°-a@) and sina =cos(90° - a). (4.11) 


In particular, since cos 45° = sin45° and cos? 45° + sin? 45° = 1, we have 
cos 45° = sin 45° : (4.12) 
= sin =—. . 
V2 
By virtue of half of the equilateral triangle, we immediately recognize 
fe} : ° 1 
cos60° = sin30° = 3 


and thus, since cos” 60° + sin? 60° = 1, we infer 


sin 60° = cos 30° = . 


“IS 


Overall, we have now obtained the following important values that should be 
memorized quickly: 


z|0°] 30°] 45°] 60° | 90° 
cosx | 1 | V3/2| V2/2| 1/2] 0 
sing | O} 1/2 | V2/2| V3/2| 1 
tanz | 0| 1/V3 il J/3| © 


A “memory hook” for the sign-values is the following table (the cosines are 
obtained by reading the table backwards): 


(4.13) 
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x 0° 30° 45° 60° 90° 
sinz | /0/2 | /1/2 | V2/2 | 73/2 | 4/2 
We do not have to know the following (harmonic) addition theorems (the 


Prosthaphaeresis Formulas) by heart, but they will be of importance when 
we deal with rotations in Section 7: 


sin(a+B) = sinacos6+cosasin£, (4.14) 


cos(a + 3) 


cosacos 3 - sinasin f. 


oY 7 


1 cos a + cos 3, sina + sin 9 
2 P*(cos(a +8), sin(a + 8)) R(cosa + cos 3, sina + sin 8) 
3 


Q(c08 8. sin gy 


} P(cos w , sin a) 3 


“8P(cos a , sina 
- 


asin 3 


y r=cosa 
y=sina 


a 


ysing 
—_ a 


£08 3 
= ~ 


Fig. 4.24 evidence of the harmonic addition theorems 


Proof: Let P..(cosa/ sina) and Pg(cos 3/sin 3) be points on the unit circle whose 
radii subtend angles a and ( with the z-axis. Now we turn to P; through the angle 
a and get P,(cos(a+ 3)/sin(a + 8)). According to Fig. 4.24 (left), its coordinates 
are 


Po+3(cos acos 3 - sina sin 3/ sin acos 6 + cos asin £). © 


>>> Application: trigonometric formulas 
Use the addition theorems in order to show 


1 
cos2%=1-2sin*z, and thus, sin 5 =+)/ 5 (1-cosz). (4.15) 


Proof: 


27 =1-2sin? x 


cos 2x = cos(x + 2) = cos? x — sin? x = (1 - sin? x) - sin 


1 1 
=> sinz = / ; (1 - cos 22) = sin =4/ 5 (1 - cosa). 


The actual names of the variables do not matter. ©) <<< 
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>>> Application: How were the tables for trigonometric functions 
created in early times? 

It is from a historical point of view that this question is of particular interest. 

Without these tables, only a few decades ago, we would not have been able 

to make exact calculations! The ancient Greeks used to create tables for the 

trigonometric functions which were afterwards taken over by Arabic mathe- 

maticians who refined them. Even the Vikings are said to have determined 

the latitude at sea with tangent tables. 

We will discuss how calculators and computers calculate values in the section 

on differential calculus and series expansion (Application p. 382). 


Solution: 
From the important values given in Table 4.13, we compute intermediate 
values using the addition theorems (4.14): 


sin 15° = cos 75° = sin(60° - 45°) and sin 75° = cos 15° = sin(90° — 15°). 


Through transition to the half angle formula (4.15), we get new values, which, 
in turn, means that the addition theorems lead to new values, etc. This was a 
lot of work, but one could determine arbitrarily many intermediate values in 
this way. If you wanted a more precise value, you could interpolate linearly 
between two neighboring values (the functions can only be replaced by a 
straight line in very small intervals). The tangent values are derived as the 
quotient of the sine and cosine. <<< 


Sometimes, you need the Simpson formulas (second addition theorems) 


sina+sinfB = 2 sin 27 F ogg 98 
+ _ 
cosa+cos8 = 2cos a 5 B cos = 5 :. (4.16) 


Proof: By virtue of Fig. 4.24, right: The points O, P, R, and Q form a rhombus 
with side length 1. R initially has the coordinates R(cos a+ cos 3/sina+sin (3). Half 
the diagonal OM has the length cos ae This also allows the coordinates of R to 
be specified by 


R(2 cos 28 /2 cos SS sin") 


cos 
2 2 2 


© 


>>> Application: three-phase current (Fig. 4.25) 

In the generation of three-phase currents, a magnetic rotor rotates within 
three induction coils and induces three dephased alternating currents. The- 
se may be used individually or jointly. Calculate the sum of all voltages 
y = sina + sin(x# + ) + sin(a + +) and the voltage difference between two 
consecutive phases y = sin x — sin(x + ) (see Application p. 177). 
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Yo= sin (x + 27/3) 


Fig. 4.25 Waves amplify or cancel each other. 


Solution: 

We show that the sum of two phase-shifted sinusoids (displacement x9, equal 
amplitude and frequency) is again a sinusoid of the same frequency but of a 
different amplitude: 

With the second addition theorems, we get 


+(x+ —(x+ 
x @ x0) cs ® (@ x0) 


sinz + sin(x + 29) = 2 sin = 2cos 5° sin(x + =). 


For Xo = =- (= 2cos $ = 1), we find the relation 
sin x + sin(a + aa = sin(a + a 
If sinz = —sin(x +7), then sin(a + 3) = —sin(a + =) and the sum is equal 


and opposite to the third function. The total sum will vanish: 


sin 4 sin(x | =) | sin (x | =) =0. (4.17) 


The voltage difference between two neighboring phases is calculated by 
sing — sin(# + x9) = sinx + sin(-z — 29) = 
xw-(x+2x0) 66s xzt+(x+2x9) 


=2sin = 2sin 3° cos(x + >). 
For xo = we have an increase in the amplitude by a factor of 2sin(—4) = 
-\/3. The voltage difference is described by 


y=-V3 cos (e+ 2) =-V3 sin(S— (n+ 2) = V3 sin(a- 7). 


In individual voltages with 230 volts, this results in a maximum voltage 
difference of about 400 volts. <<< 


20 
“a3 


>>> Application: inclined plane (Fig. 4.26) 
In order to avoid picking up a barrel of 100 kg one can roll it on a ramp, 
that is, roll it on a board with an upward inclination of a = 30°. What 
force F' = |F| is necessary (friction ignored) to compensate for the driving 
component T = |T'| of weight G? 
Solution: 

T =F =Gsina, G=mg = 100kg- 9.81m/s” = 981N, 


1 
=> F=981N.- 5 = 490.5 N. <<< 
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Fig. 4.26 inclined plane in theory ... Fig. 4.27 ...and in practice 


>>> Application: pyramid ramps 

The ancient Egyptians certainly used inclined planes (“ramps”) to lift heavy 
stone blocks (i.e. to draw them on sledges). In Application p. 74, we have 
seen that by far the greatest amount of cuboid stone blocks was used to 
build the the lower layers of the pyramids. For these lower layers, a ramp was 
obviously the best choice in order to move the blocks to the (still growing) 
height of the truncated pyramid. Ramps for greater altitudes were practically 
feasible. Obviously, the older architects worked with much more sophisticated 
methods (a “tipper” amongst other things). 

The quarry for the Cheops pyramid was about 400m south of the pyramid. 
What inclination angle should this ramp have had in order to raise limestone 
blocks to an 8% higher height? How long would such a ramp be to transport 
the last block of the pyramid, the “pyramidion”, to the top of the Great 
Pyramid? 


Solution: 
The slope of the ramp (inclination angle a) is tana = 8: 100. For the 400m 
long baseline, this results in a lift of 400- 3m = 32m. After all, this is a good 


fifth of the height, and about half of all blocks are positioned at that height. 
However, to transport even the pyramidion on the ramp, one would need a 
five times longer baseline (2,000m). Just the dams of the ramp would have 
required a multiple of the volume of the whole pyramid! <<< 


>>> Application: cable forces (Fig. 4.28) 

A symmetric wedge with the tip angle a is pressed with the force F' into a 
padding. How big are the normal forces F| and F > against the flanks of the 
wedge? 


Solution: 
F 
a F 
P= hy= = Fen 2" 
sing sin 5 
Therefore, F increases as sin y, i.e. @ decreases. <<< 


>>> Application: V-belt (Fig. 4.30) 
Let the radii be R and r and the central distance d of two circles be given. 
How long is the belt? 
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Fig. 4.28 cable forces in theory ... Fig. 4.29 ...and in practice 


Solution: 
Though a seemingly simple example, we will need a little trick that must be 


“seen”: By a parallel displacement of the common tangent segment of the two 
circles, we obtain a right triangle, to which the Pythagorean theorem applies: 
t=,/d?-(R-r)?. Furthermore, we have siny = (R-r)/d * (in radians!). 


Thus, the belt length is determined as 


L=2|R(F+e)+ter (4 ))- aad 


Fig. 4.30 V-belt Fig. 4.31 hipped roof 


>>> Application: hipped roof (Fig. 4.31) 
Calculate the volume of the space bounded by the hipped roof with eaves of 


given lengths a, b and given roof slope angles a, (. 


Solution: 
The altitude of the triangle Q12 equals 


a 
h==-t . 
5 an § 
Thus, the area A of the triangle P@2 is 


A=—. 
2 
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The volume equals the difference between, on the one hand, the volume of the 
three-sided prism with cross-sectional area A and length 6 and, on the other 
hand, twice the volume of the tetrahedron PQ2R whose base triangle PQ2 
also has the area A, and whose altitude R2 has the length d. The altitude d 
arises from the right triangle 12R with 


d=h/tana 
=V=Ab-25"= 4(b-=), da< 


>>» Application: cornering a motorcycle (Fig. 4.32) 

If a motorcycle (weight G = mg) with velocity v travels around a curve 
(radius r), it must be tilted by an angle a to avoid being offset by the 
centrifugal force (F = mv?/r). Show that a is independent of the mass? 


Fig. 4.32 cornering in theory 


Solution: 
We have 


ta = 
na=>=— 
G 


independent of the mass! In addition, the frictional force must be greater 
than F’. 


Numerical examples: 

1) From the picture (Fig. 4.33, left), the inclination angle a and the curve radius r 
are known. How fast was the moped traveling on the path of the garden way? 

a= 15°, r=10m> v= Vgr tana 5m/s* 18km/h. 

2) How big must the angle be if v and r are given as such? 

r=70m, v= 72km/h = 20m/s > tana * 0.58 > ax 30°. 

3) The pictured extreme carver (Fig. 4.35) tacks a curve with a radius of r = 5 m 


at a speed of v = 12 m/s. This results in tana = 2.88, which means that the board 
has to be inclined at an angle of 71° in order for the carver to maintain his balance. 
More than three times the Earth’s gravity is exerted on the carver. “Touching the 


> as in the photo is a physical necessity, given a standard slope inclination of 


snow’ 
20°. The athlete is forced to literally drag himself across the snow if he is to master 


the curve. << 
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Fig. 4.35 extreme snowboarding 


>>» Application: development of a cone of revolution (Fig. 4.36) 

A cone of revolution (radius r, height h) transforms by development (“flatte- 
ning”) into a circular sector of radius s with the central angle w. Compute w 
in relation to half of the cone’s angle a of aperture. 

Solution: 

According to Fig. 4.36, the arc length of the cone’s base circle must match 
the arc of the circular sector. Therefore, 


( 
Qnr=sw>w=27-. 
s 


r 
With — = sina, we obtain the formula 
8 


w=27r sina or w° = 360° sina. (4.18) 


The second variant of the formula arises from the first through multiplication 
by 180. << 
>>» Application: crank slider (Otto-cycle engine) (Fig. 4.37, left) 

The piston drives the crank pin A via the pin B and the push rod AB, and 
this causes the rotation of the crank shaft DL. Calculate the distance LB as a 
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y(a) 


Fig. 4.36 development ofthe Fig. 4.37 petrol engine, time-distance diagrams 
cone of revolution 


function of the rotation angle a. This distance is the key to the entire kine- 
matic analysis of the transmission (velocity and acceleration distribution). 


Solution: 

The distance y = LB is, of course, dependent on the angle of rotation a, and 
this, in turn, is proportional to the time ¢. Thus, we obtain the path-time 
diagram of B via the equation y = y(a(t)) =v): 


LB=LH+ HB, 


LH =r cosa, AH =rsina, HB=\Vs?-AH , 


y= LB=r cosat+ Vs?-r2sin7a. 


Fig. 4.37 on the right shows such ae y = y(a) for different ratios r: s. 
For s >>r (say: s much bigger than r), we get 


V 52 —r2sin? a=sy/1- ree ars 


and the path-time diagram approaches an ordinary sinusoid. The graphs of 
y = sinz and y = cosz differ only by parallel displacement along the z-axis 


T : F : : 
by 5 Therefore, in both cases, we are dealing with a sine curve 
y=Pr cosats. 


The demo program otto_engine.exe on this book’s website was created 
using y(a@). For mechanical engineers, an additional velocity diagram is still 
important. This is obtained, as we will see in the chapter on differential 
calculus, by means of the derivative function (Application p. 276). <<< 


Quite often, we need the following formula: 


sin 2a = 2sinacosa. (4.19) 
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Proof: We consider the right triangle ABC in Fig. 4.38 (with AC = 1). It has the 
area 


I. 
A, = 5 sin a cos @. 


We reflect it in AB and obtain an (isosceles) triangle ACC* of twice the area (area 
Az = sinacosa). The height CH in this triangle is sin2a and this allows us to 
calculate the area Ag in the following way: 


1-sin2 
Ag = ae = sinacosa =2 Aj}. 


© 


>>» Application: area of a regular n-gon (Fig. 4.39) 
Calculate the area of a regular n-gon with circumradius r. 


Fig. 4.38 sin 2a = 2sinacosa Fig. 4.39 area of a regular n-gon 


Solution: 
The following holds: 


8 . . 360° 
~=rsiny, h=rcosy with p=—. 
2 2n 
From this, the area is computed 
s 3 ee 9sin 2p 
<a -h=nr* sinycosy=nr 
nr? _ 360° 
A = — sin —. (4.20) 
2 n 


® Remark: We let n “grow” to a large value. Then the area of the polygon “converges” 
to mr’, that is, the area of the circle of radius r. Even Archimedes, more than 2,000 
years ago, proceeded in a similar manner — namely, via the circumference of the 
regular n-gon — and approximated the number 7 quite well. 

If we use the “limit notation”, then we have 

360° og 


Das ae 
A,=r~ lim =sin =r un. 


noo 


After changing to radians (360° = 27), we arrive at the following by cutting out r? 
and dividing by 7 

: n . Qn 

lim — sin — = 1. 
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: : 2 ae 
If we write x instead of = (so x converges to 0, as n tends to infinity), then we get 


n 
an important relation that we need in differential calculus: 


1 sin x 
lim —- sina = lim =, (4.21) 
z>0 @ x-0 @ 
In words: 
For very small angles, the radians equal the corresponding sines. | ® <<< 


>>> Application: upward inclined throw and maximum throw distance 
Calculate the height h after t seconds by modifying Formula (2.17) in App- 
lication p. 38: 

h = G(t) - B(t) = (vosina)t- 50. (4.22) 


Herein, vg and g are again top speed and acceleration due to gravity. The 
initial angle to the horizontal direction shall be denoted by a. The air resi- 
stance is neglected. Calculate the throw with a given vp and a. For which a 
can the best throw be achieved? 


y,h 


Git) 


Fig. 4.40 inclined upward throw 


Solution: 
By solving the quadratic equation (4.22), we obtain 


1 
t1.2 = =(v0 sina +/v2 sin? a, - 2gh). 
g 


The highest point is reached for D = ve sin? a — 2gh = 0. Thus, 
ve sin? a 
2g 


= at the height H = 
g 


at time to = 
The total time of the throw until the impact in the plane h = 0 equals 2to. 
For the horizontal component (the xz-coordinate of the thrown object), the 
following applies: x(t) = tugcosa. From the total time of the throw, the 
throw distance is computed as 


2sinacosava 
g 


1,4. 
w = 2tgvop COS a = = vasin 2a. 
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Ta 


Fig. 4.41 Galileo’s experiment Fig. 4.42 A high long jump or a long high jump? 


if 


The theoretical maximum throwing distance is obtained from the maximum 
of sin2a. The sine curve has a maximum value of 1 at 90°. From sin 2a = 1, 
we deduce 2a = 90°, and therefore, a = 45°, in order to obtain the maximum 
throwing distance. 

® Remark: The air resistance has to be taken into account, at least since Galileo’s 
experiment with balls of iron and wood on the Tower of Pisa (Fig. 4.41). The 
trajectory is then a cubic parabola. When throwing a rounder’s ball, a shot put, a 
javelin, or when attempting a long jump (Fig. 4.42), it is not possible at an initial 
angle of a = 45° to obtain the greatest initial velocity v9, mostly for physiological 
reasons. Yet, the initial velocity is even more important than the initial angle: The 
throw (displacement) grows quadratically with v9. Therefore, the best sprinters are 
often the best long jumpers (Carl Lewis). ® <<< 


Equations of type Psinx + Qcosx +R=0 


This type of equation is quite common in practice (Application p. 144, Ap- 
plication p. 335) and leads to a quadratic equation: 
First, we rearrange 


P sing = -Q cosz - R= P” sin? x = (Q cosa + R)? 
= P* (1-cos* x) = Q? cos? x + R? + 2QReosz. 
If we use the abbreviation u = cosa, then we have the quadratic equation 
(P? + Q?)u?7+2QRu+ R?- P?=0 


with the two (sometimes coincident and sometimes not real) solutions for 
U = COB 2: 


-2QR +2,\/Q?R? + (P? + Q?)(P? = R?) 
2(P? + Q?) 7 
QR+/P!+ P?Q2-P?R2 -QR+P\/P?+ Q?2- R? 
P2+Q? ~ P2+Q? ; 


U1,2 
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>>> Application: intersecting homofocal conics 
In Chapter 7, we frequently need the solution of the equation (for example, 
in Application p. 335, Application p. 336) 


ry T2 


l+e, cose 1+¢€2cos(x+6) 


It describes the intersection of two conic sections with one common focal 
point. The constant value 6 is the “twist angle” of the two conic sections. 
Deduce the equation by means of the addition theorems in order to obtain 
P sinx+Q cosx+R=0, the solution of which is well-known. 


Fig. 4.43 intersection of homofocal conics (right: confocal conics with Fh = Fy) 


Solution: 
Multiplying by the common denominator, we first get 


ro(1 +e, cosx) =171(1+€2 cos(x+0)). 
We set cos(x + 6) = cosa cosé — sinzsiné = c cosx —s sinx. Then, we have 
rot+7rgé, COST =71 +71 €2(c cosx—s sinz) 
which yields 


(r2€1 —T1€2¢) cosxtriéegs sinv+rg-71 =0. 
ey ae —— 
P Q R 


® Remark: In this context, one speaks of an orthogonal grid (Fig. 4.43, right) built 
by the two pencils of confocal conics (see Application p. 534). ® <<< 
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4.4 The scalene triangle 


x Big ahi) CP BU oy 


= (WEYA)Y 
Pye years (2EWA)Y ». 
a | r= (avt\)¥ -. 
ye 3 sl) GU Cad 
ee ees (aet\)US2 
Fig. 4.44 sum of angles Fig. 4.45 from an Arabic textbook ... 


For an unambiguous determination of a scalene triangle, we need exactly 
three specified elements. 

Exception: The specification of the three angles a, 6, and ¥y is not enough, 
because, even among the three angles, the following relation holds: The sum 
of the angles in a triangle is 180°, that is, 


a+ B+ = 180°. (4.23) 


Proof: We draw the parallel to AB at C' (Fig. 4.44). There, a and 6 appear once 
again as parallel angles. © 
Incidentally, this relation yields sin y = sin(180° — (a + BY) and thus, 


sin y = sin(a+ £). (4.24) 


® Remark: The so-called elliptic geometry deals with spaces where one can draw no 
parallel to a straight line. There, the sum of angles in a triangle is always larger 
than 180° and not the same for all triangles. This is not just a theoretical gimmick, 
but plays an important role in modern physics (“curved space”). ® 


Now, let the sides and angles of a triangle be known. We calculate the auxi- 
liary parameters he, Ca, and cy (Fig. 4.44): 


he = asin 8, cg =acosB, cy =Cc- Ca =Cc-—acos 8. (4.25) 
Now, we apply the Pythagorean theorem: 
c} +h? = b* = (c-acosB)* + (asinB)? = b* 


= c’+a7(cos” 6 + sin? B) — 2accos 6 = 0’, 
Sh 
1 
which leads to the 
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Law of Cosines, a generalization of the Pythagorean theorem. The follo- 


wing three formulas differ only by a cyclical shift of the variables: 


b = c+a*-2cacosB, 
Ce = a®+b*-2abcosy, (4.26) 
a? = 0b? +c? -2becosa. 


In order to break away from notations, we should keep the following sentence 
in mind: 


The square on one side is equal to the sum of the squares of the other two 
sides, reduced by twice the product of the other side lengths times the cosine 
of the enclosed angle. 


From the side lengths of a triangle, the angles can be calculated. For example: 


b2 +c? —a@? 


COS @ = 
2bc 


>>> Application: forces in equilibrium (Fig. 4.46) 
Three coplanar forces (i.e. forces acting in a plane) 


F, = 60N, Fo =45N, F3 = 30N 


attached to a point are balanced. Calculate the angles which they mutually 
subtend. 


Solution: 


a= 60, b= 45, c= 30 => Law of Cosines > 
a= 104.5°, 6B =46.6° > 
yp=a+P8=151.1°, w=180° -a=75.5°, ¢ = 180° - 8 = 133.4". 


ddd 


If two sides and the enclosed angle are given, one also uses the Law of Cosines. 


Fig. 4.46 forces in equilibrium Fig. 4.47 Belgian roof trusses 
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>>> Application: Belgian roof trusses (Fig. 4.47) 

The angle ¢ of inclination of the bottom chord and the angle 6 of the roof’s 
inclination with a total chord length AB = 2s are given. We are looking for 
the lengths of the other bars. 

Solution: 

Once we have found the rafter’s length AC = a we know the two side 


lengths AB and CA of the triangle ABC and the enclosed angle a = 6 —€. 
The side BC is then found via the Law of Cosines: 


BC =4/ 4p 2 AC ABAC wae, 


BD = 2(s— AB cose). 


>>> Application: intersection of two spheres (Fig. 4.48) 

The intersection of two spheres 41, “2 (centers My), Mo, radii r1, re, central 
distance d = M,Mz) is a circle k with radius r and center M in a plane 
normal to the axis M,M2. Compute r and e = MM,. For which d is r = 0 
and r? <0 (or r not real)? 


Solution: 
In Fig. 4.48, we can see a scalene triangle M1 M25S with known side lengths 
r1, rg, and d. We apply the Law of Cosines and get 


Oa so 
rtd’ -r5 


2rid 


ra 7 r? +d? -2rid COS @ => COS Q@ = 


Fig. 4.48 intersection of two spheres (special and general view) 


The following holds: 


DD AD ghd 
ret+d°—-1r5 


2rid 


<1 


cosa<1> 


= 24d? — 72 < Qnid > r? -IQryd+ d? <r? > (r,-d)* < 12. 
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Fig. 4.49 the coming into being of soap bubbles consisting of spheres 


For (r; — d)? > r3, the circle of intersection will not be real. For 
ry-d=4rg>171-1r2=dorrm+r2=d, 


the two spheres touch at a point (“zero circle”). In the case of a real inter- 
section, the radius r and the distance e from the center point M to Mj, are 
related by 

r=7r,sina and e=r, cosa. 


® Remark: The plane that carries the circle of intersection is called the radical plane. 
The intersection of two spheres is always a circle, regardless of whether it is real 
or not. By virtue of Application p. 534, we note that all spheres share the absolute 
circle of Euclidean geometry, which is not only at infinity, but also carries no real 
point. ® << 


Area of a triangle: 
First of all, the area A of a triangle (with the notation from Fig. 4.44) equals 


_ che _ cbsina 
~ 2-2 


A 


Since the labels can be shifted cyclically, we also get 


Au aesing _basiny _cbsina (4.27) 
2 2 2 
From (4.27), we immediately deduce the 
Law of Sines: 
ce one an (4.28) 
sina sin@ siny 


In words: 


The ratio of side length and the sine of the opposite angle is constant. 
The ratio of any two sides is equal to the ratio of the sines of opposite angles. 
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Thus, the following tasks can be solved: 
Task 1: Let two angles (and thus, all three) and a side (for example, a) of a 
triangle be given. Then, the side b equals 


‘ a 
b= sin 6 ——. 
sin @ 


>>> Application: height of a tower (Fig. 4.50) 

At A, one measures the angle ¢ from the roadside to the base F’ of the tower 
and the elevation angle 6 to the top H. Then, one moves along the roadside 
to position B at distance s = AB and again measures the angle ¢ to the base 
F. Thus, the height of the tower is determined: 

1) The angles ¢, y, ~ sum up to 180°. Thus, w = 180° -e-y and sinw = 
sin(é + y). 

2) We apply the Law of Sines to the triangle ABF: 


x 8 siny 


s ; 
sin(é+¥) 


3) The altitude h in the right triangle AF'H satisfies 


. = . > 
sing sinw 


a ay eee 
x 


A By B 
Fig. 4.50 height of a tower Fig. 4.51 two equal solutions 


Task 2: An angle (for example, a) and two sides that do not enclose this 
angle (for example, a and b) are given. Then, we can calculate the angle 6 
by 
sinf=b = 
a 

It has to be taken into account that this formula can have two solutions (or 
only one or none at all), namely 3 and 180° - 6 (Fig. 4.51). In practice, one 
solution can often be neglected due to an additional condition. 


>>> Application: forces in equilibrium 
Two forces F; = 60N and F» acting on the same point enclose the angle 
y = 151.1° and result in a force Fy = 30N. Calculate Fy! 
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Solution: 
With the usual notation, we have 


a=60, c= 30, 


c a . siny 
>sina=a 


= 0.967 => 


siny sina 

ay » 75.2°, ag » 180° — 75.2° = 104.8° > 
B1 = 180° - a, -7=75.9°, Bo = 180° - a2 - 7 = 46.3°, 
a a 


8 60.2, ) = sin 62 : 
sIn 1 s1n 2 


b; = sin By x 44.9. 


Thus, we see that there are two equally correct solutions! An additional 


condition could help us eliminate one of the two solutions! <<< 


An important result related to the scalene triangle is the angle of circumfe- 
rence theorem (Fig. 4.52). It is a generalization of Thales’s theorem: 


Angle of circumference theorem: A chord of a circle forms the angle 7 from 
each point on one part of the circle and the angle 180° — y from each point on 
the supplementary part of the circle. The central angle for the supplementary 
arc is twice the central angle on the primary side. 


Fig. 4.52 angle of circumference theorem Fig. 4.53 diagonals in an n-gon 


Proof: Consider a circle with the fixed chord AC and another point B on the 
circle. With the notations of Fig. 4.52, we get: The sum of interior angles in the 
quadrilateral 1B2M equals 360°, as in any quadrilateral, for it can be divided along 
a diagonal into two triangles. It has right angles at 1 and 2. Thus, the remaining 
angles sum up to 180°, €; + €2 = 180° - @. Furthermore, 2(¢) + €2) = 4 AMC, and 
therefore, 3 = 180° - 34AMC for each point B on the circle on the same side. On 
the other side, the angle 2 AMC is obtained by taking 360° - 2 AMC, and we have 
2 AB'C = 180° — 8. Since 2 AMC = 360° — 28, we further have 2 AMC =22 AB'C, 
that is, the central angle is twice the angle of circumference. © 
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The angle of circumference theorem leads to the following statement: 


In a cyclic quadrangle, i.e. a quadrangle inscribed into a circle, the sum of 
opposite angles equals 180°. 


Almost simultaneously, the following statement has thus been proven: 


The diagonals in a regular n-gon make equal angles with the neighboring dia- 
gonals. 


Proof: Let P be an arbitrary point, and let Q and R be two neighboring points of 
the n-gon (Fig. 4.53). In the circumcircle, we can apply the theorem of the angle 
of circumference with the chord QR. Thus, 2QPR does not depend on the actual 
position of P. Further, 2QMR = a = 360°/n is constant. According to the theorem 
of the angle of circumference, the central angle is twice the angle of circumference, 
and thus, 2QPR = a/2 = 180°/n is constant. © 


>>> Application: positioning by angle measurement (Fig. 4.54) 

From the map, we know the relative position of three distinct points A, B, and 
C on the ground. In order to determine our own position P, it is enough to 
measure two angles (for example, yp = 4 APB and y= 4 BPC). P is located 
on two circles according to the theorem of the angle of circumference. <<< 


It is very easy to prove the following theorem with the help of the Law of 
Sines (Fig. 4.55): 


Each interior angle bisector of a triangle divides the opposite side in the ratio 
of the adjacent side lengths. 


O () a 
A(-vi0) V /w " Biuld) x 


Fig. 4.54 positioning Fig. 4.55 reflection in the circle 


Proof: According to Fig. 4.55, we should have wu: v = a:b. In the triangles BWC 
and AWC, we get the following after applying the Law of Sines 


U a v b 


a and —— =— 
sinw sing sinyw  sin(180° - y) 
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Fig. 4.56 four reflections in a circle... Fig. 4.57... or a cylinder of revolution 


Since sin(180° - ~y) = siny, the relation wu: v = a:b already holds. ©) 


>>> Application: reflection in a circle 

We find an application of the above theorem if we consider a “reflecting” circle 
centered at W on the angle bisector through C’. Then, C is the reflection of 
A in k with respect to B. Or in other words: A laser ray emanating from A 
is reflected at C to B. We fix a Cartesian coordinate system at W. Then, the 
points A, B, C have the coordinates A(-v/0), B(u/0), and C(a/y) (where 
a? +y? = 77, and thus y? =r? - we), Furthermore, according to the law of 
reflection (angle of incoming ray = angle of outgoing ray) and the above 
theorem on the bisecting line, we find 


V(z+vPty? ov (atv) ty _v? _ (atv) +r?-2? _v’ 
J(a-ut+y? wo (w-u)?ty uw (w-u)?+r29-a? uP 


Quxtvu2 +r? v2 2 2 2 2 2 2 
- => uo (Qua +0%4+7*) =0*(-2ux+ue+r 
-2ua+u2+r2 ut ( ) ( ) 
=> Quv(ut v)¢ = 7?(v? - u?) > & = one, 
2uv 


The y-value is calculated via the Pythagorean theorem as y = Vr? — x?. 


Fig. 4.58 Without knowing the possible maximum number of reflections (in this 
case two, because the viewer is outside the cylinder), it is not easy to understand 
the individual pictures of this series. The time difference in between the four pho- 
tographs was 1/60 s each. This also helps us to understand the situation. 
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® Remark: The determination of the reflection requires only the solution of a qua- 
dratic equation, although in the general case (Fig. 4.56), reflecting in the circle leads 
to an irreducible equation of degree 4. This is because, in this case, two reflections 
of A in k& are already known. Trivially, these are the intersections of k with the 
straight line AB. ©® <<< 


Fig. 4.59 mirror anamorphosis 


>>> Application: mirror anamorphosis (Fig. 4.59) 

When looking at a point P in a cylindrical mirror (eye point £), the reflection 
point P* is unequivocal. In principle, the task is still two-dimensional: In a 
top view (right), we look for the reflection point P*’ of P’ in a circle with 
respect to E’. How can one draw distorted images in the base plane such 
that the reflection “makes sense”? 


Solution: 

We put the cart before the horse and start with the result P*. Then, we just 
have to reflect the ray P* F in the cylinder (we reflect at the circle in the top 
view, and the inclination of the ray stays the same). Further, we intersect the 
reflected ray with the base plane and get the desired point P, the reflection of 
which is P*. Note that points “behind” the cylinder will never have a visible 
reflection point. << 
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4.5 Further applications 


In this section, you will find some interesting exercises relating to the previous 
sections, but they can also be skipped without affecting the understanding 
of the overall concept. 


>>» Application: rolling cone (Fig. 4.60) 

When a cone of revolution is rolling on the base plane, it will stay inside a 
circle around the apex. We now choose two parallel circles on the cone. Let 
the ratio of their radii be 1: k, and d be their distance measured on the 
cone’s generating line. What is the radius s of the inner blue circle in Fig. 
4.60? Compare also Application p. 154. 


Fig. 4.60 The rolling of the cone can be substituted by the rolling of two rigidly 
connected parallel circles of the cone. 


Solution: 
Due to the intercept theorem, we have s : r = (d+) : (kr), and thus, 
s=d/(k-1). «a 


>>> Application: Why does a train not derail? 

This application does not need calculations, but has something to do with 
arc lengths. The answer “The train does not derail, because it runs on rails” 
is too simple. 


Fig. 4.61 conical wheels 
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Solution: 

The train would continually derail if pairs of co-axial wheels were not conical. 
Even the steel ring on the inside of the wheel cannot prevent derailment. It 
is mainly the slightly conical shape of the wheels (Fig. 4.61) that can do 
so: If the train drifts to the right, then the contact circle on the right wheel 
apparently increases its diameter due to the conical shape of the wheel. At 
the same time, the diameter of the contact circle on the left wheel decreases 
(causing a tilt of the axis). This is exactly what forces the axis to move back 
into the right and stable position. 


oN 


Fig. 4.62 No problem with taking the left curve: The smaller circle of contact on 
the left and the larger to the right make the train stay on the tracks. 


® Remark: When the train has to take a curve, the conic wheels are also of great 
advantage. Imagine the left curve in Fig. 4.62: The tracks have to be inclined, 
otherwise the centrifugal forces would tip the train over. These centrifugal forces 
will still shift the train to the right. This causes the circle of contact with the right 
track to increase, and the corresponding circle on the left to decrease. This is good, 
because now the train automatically tilts to the left (Fig. 4.60). Inclining not enough 
to the right will push the train further right, whilst inclining too far to the right 
will push it back to the left. The perfect equilibrium lies somewhere in between. ® 


ddd 


Fig. 4.63 The Sun and the Moon both appear under a viewing angle of # 0.5°. 


156 Chapter 4: Angles and trigonometry 


>>> Application: diameter of the Sun and the Moon (Fig. 4.63, Fig. 4.64) 
The Sun and the Moon can be seen in the sky at almost exactly the same 
angle, namely a little more than half a degree (this was not always so, because 
the Moon was closer to the Earth in ancient times). Moreover, since the 
ecliptic plane of the Moon is tilted only slightly towards that of the Sun, 
solar eclipses frequently occur. Through precise angle measurements from 
various points on the Earth, the average distance between the two celestial 
bodies can be calculated: dg * 150,000,000km, djy ~ 384,000km. What are 
their diameters? 


JA 


Fig. 4.64 Partial solar eclipse in May 2003: blended pictures over the course of 
approx. 20 minutes (5:09-5:31). The Sun and new Moon start “leaving each other”. 


Solution: ' 
Either visual angle can be expressed in terms of radians as y » ——. Now, 


we compare with the diameter D of the Earth (* 12,740km) and find 


Ds = dg vy * 1,300,000km * 100 Dz, 


Dy = dy vp ® 3,350 km # 0.26 Dz. 


The exact values are 
Dg =dgp%1,392,000km, Dy =dyy*3,476km, 


because the angle is slightly larger than 0.5° and the distances — especially 
that of the Moon — also vary. 


@ Remark: A total solar eclipse occurs relatively infrequently. A total lunar eclipse 
can be observed more often, because, in this case, the Earth shadows the much 
smaller Moon (1/4 of the Earth’s diameter). Therefore, lunar eclipses take even 
longer, because the Moon takes hours to “dive through” the shadow zone. From the 
Moon, this phenomenon must look extremely spectacular: The huge “new Earth”, 
then, begins to glow red at the edges (more in Application p. 315). ® <<< 
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>>> Application: landing (Fig. 4.65) 

An airliner lands at sea level within 20 minutes of coming from an altitude 
of h = 35,000 (1 foot = 0.314m = h» 11km). What is the average descent 
angle when the average body speed (ground speed) is 540km/h? 


Fig. 4.65 smooth landing on the water surface 


Solution: 
The total distance is s = 540km/h- sh = 180km. For the descent angle a, 
we have tana = h/s. For small angles, the tangent is equal to the angle 


in radians (Application p. 383). By multiplying, we get the degree value: 
Oo _ 11km . 180° 35° 
180km T Ses 


® Remark: A spectacular “jump” with special “bat suits” between the continents 


of Africa and Europe in the summer of 2005: The jump started at an altitude of 
35,000 feet at an outdoor temperature of —50° above the Moroccan coast. Landing 
took place in Algecira Bay in Cadiz, with a ground distance of 20.5km. The gliding 
duration was 6 minutes. The descent angle a was approximately 30°, since tana * 
11: 20.5. The average speed was 240 km/h (204km/h ground speed). The parachute 
was opened at the last minute (see Application p. 21). ® <<< 


>>> Application: “curvature” of the sea (Fig. 4.66) 

The surface of the sea, as well as that of larger freshwater lakes, is visibly 
curved. By connecting two distant surface points A and B with a straight 
line, the “vault” can be defined. What is the vault w of a lake with a length 
of [km? What height h must a tower at B have in order to see the middle 
of the lake? 


Solution: 


Half the central angle y (Fig. 4.66) is calculated in radians from the Earth’s 
radius R = 6,370km by 
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Fig. 4.66 curvature of the water surface, curved horizon 


Furthermore, MN = R cosy and MC = and so 
cos 
L 1 ‘ : 
w=R(1-cos =), h=Rh z 71) (y in radians!). 
2R COS 57 


Numerical example: 

D=10km>w=1.96m, hew, 

L=30km => w = 17.66m, he w, 

L=100km > w = 196.23m, he w, 

L=200km => w = 784.91m, h= 785.01m, 

L=750km > w = 11,035m, h = 11,054m. 

Thus, we see that, for a small L, the curvature and the tower height are almost 
identical. 

® Remark: A final numerical example: An airplane at an altitude of 11km can be 
seen under ideal conditions from 750 km/2 = 375km. Conversely, it is only from this 
distance onwards that the aircraft can be detected by radar. Looking at the sea 
with a wide angle lens, one sees a slightly curved horizon of approximately 500 km 
in length. The curvature itself accounts for about 11km. This view is captured by 
the photograph of Fig. 4.66. @ <<< 


>>> Application: “morning star” and “evening star” 

(Fig. 4.67) The planets move on elliptical orbits around the Sun. Except for 
the rather eccentric elliptical orbit of Pluto, the orbital ellipses are almost in 
one plane, the ecliptic. The orbital ellipses of Venus and the Earth are almost 
circular (medium distance to the Sun 108 million km and 150 million km). 
What is the maximum viewing angle, under which one can see the Earth, 
the Sun, and Venus? Why is Venus the “morning star” or “evening star”? 


Solution: 

From Fig. 4.67, one recognizes a right triangle ETS, the sides of which are 

formed by the tangent to the orbital circle of Venus, the axis Earth-Sun, and 

the radius at the contact point 7’. For the angle a in this triangle, we get 
TS 108 


—— Sree 0.72>ax 46°. 
ES 150 


sin @ 
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Fig. 4.67 Sun, Venus, Mercury 


Venus is, thus, always seen at an angle of at most 46° around the Sun. 
Therefore, you see it only just before sunrise or just after sunset. 


® Remark: Still closer to the Sun is Mercury. With an average distance of 58 million 
km, the calculation yields the maximum visual angle a x 23° (because of the more 
eccentric path, this angle may, however, be up to 27°). Yet, since Mercury is much 
smaller than Venus, it generally appears less bright in the firmament. The Sun, 
Mercury, and Venus are almost on a “straight line” (or more precisely on a great 
circle in the sky): the Earth, the Sun, Mercury, and Venus are, indeed, almost in 
one plane (the ecliptic), and this plane almost appears to “project” from the Earth. 
In Fig. 4.67, the positions of Venus and Mercury in the firmament are labeled by 
V* and M*.@ <<< 


>>> Application: lapping a planet (Fig. 4.67) 

The Earth has an orbital period of 365 days around the Sun. Venus has one 
of about 225 days, and Mars takes 1.88 years. All orbits lie almost exactly in 
a plane and the orbital rotations are equally oriented. How frequently do the 
planets have the same relative position (for example, minimum distance)? 


Solution: 

Assuming that the orbits are circular, the Earth orbits through an angle » 1° 
per day around the Sun. For Venus, it is » 1.6°. That is, Venus orbits ~ 0.6 
faster and needs about ae = 600 days to return to the same relative position. 
Mars loses * 0.47° per day with respect to the Earth. Therefore, a similar 
position is reached after 766 days. 


® Remark: In early June of 2003, two Mars probes were launched so that, when 
Mars reached its closest position to the Earth at the end of August, the probes 
had travelled about “half” of their trajectory. In that year, Mars was particularly 
close to the Earth, because, during its slightly elliptical orbit, the lapping occurred 
when Mars reached its position closest to the Sun (“perihelion”). Since Mars — in 
comparison with Venus — is farther away from the Sun than the Earth, we can see 
its fully lit side during a lapping and Mars thus becomes the third brightest celestial 
body after the Sun and the Moon. @ <<< 
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>>> Application: How long does a solar eclipse last? 

When the Moon is exactly between the Sun and Earth, there may be a total 
solar eclipse. However, the observer on the Earth has to be in the umbra of 
the Moon (Fig. 4.68). This approximately circular area has a diameter of at 
most 269 km. Calculate the velocity with which the shadow moves over the 
surface of the Earth. 


Solution: 

Fig. 4.68 illustrates that it is essentially the speed of the Moon on its or- 
bit around the Earth that matters. Within 27.52 days, the Moon travels 
27 - 384,000 km, i.e., about 61 km per minute. Thus, the area of the umbra 
would rush within 4.5 minutes over the observer. However, the observer on 
Earth rotates relatively quickly about the Earth’s axis (which is in general 
slightly inclined to the connection Earth-Sun). The maximum speed is the 
path velocity at the equator (1,667km/h, which is around 28 km per minute, 
see Application p. 171). Thus, the speed of the umbra decreases to a value 
between 61km/min and 33km/min. The total solar eclipse can, therefore, 
take 269/33 ~ 8 minutes in extreme cases. <<< 


Fig. 4.68 the umbra of the Moon wanders over the Earth 


>>> Application: determination of an inaccessible radius (Fig. 4.69) 

In order to determine the radius R of a cylinder of revolution, one can use 
three round bars of equal thickness with a diameter d. Then, we measure the 
height difference h between the outer and inner bars. What is the formula 
for R if d and h are given? 

Solution: 

We refer to Application p. 123, where we have derived (with the notation of 
Fig. 4.69) Formula (4.3): 


82 + 4h? 
8h 


Since d = M, Mo, we find the distance s = M,M3 = 2Vd?-h2. The radius 
that we are looking for is thus 


4.5 Further applications 161 


2 722 2 2 
pers d Ae) Ha od Pd _ diay 
2 8h 2 2h 2 oh 


® Remark: Instead of measuring the height difference h, one could also measure the 


2 


total width b of the three bars. Of course, it is s = b—d, and thus, h = y / d? - (5) = 


p_0- 


effect on h, so that the result is less reliable. 


. A small measurement error in b may already have quite a strong 


Numerical example: 
Variant 1: d= 20mm, h = (4.440.1)mm > R= (5541) mm, 
Variant 2: d= 20mm, b= (59.0+0.1)mm > R=(55+2)mm. © <<< 


Fig. 4.69 determination of the inacces- Fig. 4.70 aligning the bounding box by 
sible radius of the red cylinder image rotation 


>>> Application: tilting the horizon ... (Fig. 4.70) 

For the aesthetics of a photograph in which the horizon can be seen, it is im- 
portant that the horizon is parallel to the margin. It is best to pay attention 
to this at the time the photograph is taken. If necessary one could instead 
rotate the image through an angle of 6 by using image processing software 
afterwards, risking a loss of quality. Subsequently, the largest possible rec- 
tangular part with the same proportions is cut out. How much space (pixels) 
does one lose? 


Solution: 

Let the proportions of the original photo be a x b cm?, where a: b is usually 
3:2in SLR cameras and 4: 3 in compact cameras. Then, we get the following 
with the notation of Fig. 4.70: e = 1/2Va? + b?, e = arctanb/a. 

The oblique auxiliary triangle MEE” is drawn in green and uniquely defined 
by e, 6, and e. The scaling factor of the cropped image is found via the Law 
of Sines: 

k =e" :e=sine: sin(180° -<- 6). 


The dimensions of the new rectangle are, then, a* = ka, b* = kb and the loss 
of area (loss of pixels) is (a* -b*) : (a:b) = k2. 
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® Remark: For a rotation angle 6 = 10°, the loss in the 3:2-format is 35%. In the 
4:3-format, it is 32%. For turning angles of about 20°, we lose more than half of the 
pixels (up to 70%). So far, we have not taken into account that a rotation through 
an arbitrary angle will result in a loss of quality (only rotations through multiples 
of 90° are free of quality loss). ® <<< 


>>> Application: jumping high 
Compare the two photos in Fig. 4.71. 


Fig. 4.71 comparable situations 


Solution: 

The height of a jump depends almost exclusively on the take-off velocity. 
The following principle is well-known from physics: In the absence of air 
resistance, if something falls down from h meters, it reaches a speed of v = 
V2gh. Thus, if something shoots up with a vertical speed of v, it reaches 
the height h = v?/(2g). For instance: If a motorcycle takes off at a speed of 
14 m/s and at an angle of 30°, the “vertical velocity” is 14-sin 30°m/s = 7m/s. 
The motorcycle will, then, reach a height of h = 49/20 m (= 2.45 m). 


eS = 


Fig. 4.72 These images show the proportion of the humpback whale and its enor- 
mous fluke, which exerts tremendous forces to speed up the giant. 


® Remark: 2.45 m is the current world record for high jumping. However, due to 
the jumping technique that is employed, the jumper has to lift his barycenter by 
about 1.50 m, which leads to a vertical speed of approx. 5.5 m/s. Therefore, he has 
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to initiate the jump at a velocity that allows him to reach the necessary vertical 
speed. 

Humpback whales are famous for jumping out of the water. They usually get out by 
a little more than half of their length (about 8 m). Their barycenter, however, does 
not need to be lifted quite as much. If the whale could speed up to 10 m/s and hit 
the surface in a perpendicular position, it would theoretically lift its barycenter by 
5 m. However, this is not enough to lift its entire body out of the water. ® <<< 


>>> Application: throw into deeper ground (Fig. 4.73) 

In Application p. 142 (Fig. 4.40), we have calculated that an initial angle of 
45° yields an optimal throw. This is only true on a horizontal ground. When 
throwing in a downhill terrain, the optimal angle may be smaller. 


Fig. 4.73 tilted throw into terrain 


If one uses the time parameter t, a point P(a/y) on the ballistic parabola 
has the coordinates x = ugt cosa, y = ug t sina - gar. 

The first equation can be solved for t, which can then be inserted into the 
second equation: 


x sina g 2° 
t= >Yy=2 a ee 
Uo COS cosa 29 .cos*a 


Thus, the trajectory has the form 


g 


—_ b = tan Qa. 
2u? cos? a’ 


y=ax?+be with a=- 
Now, we intersect two trajectories with different launching angles: 


yy = 127 + biz, Yo = anu” + box, 
Y=y2> ax" + bx = ax? + box => (a1 = az)x” + (by = bg )a = 0. 
The trivial solution x = 0 is the launch point. The non-trivial solution is 
obtained by dividing through by x(a; - a2) # 0, which yields 
Xs = (bo = by) : (ay = a2). 


If the terrain’s slope permits such a throw at all, the flatter ballistic curve 
laps the steeper one. <<< 
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velocity v1 


velocity v2 


Fig. 4.74 The wave front plausibly explains the law of refraction of light rays (the 
wave front tilts upon hitting the layer of separation). 


>>> Application: Snell’s law (Fig. 4.74, Fig. 4.75) 
From Fig. 4.74, we obtain 


BO, vy BO, AC ; : 
n> ——= =n: —=sinag,:sinag=n. 


ACz 2 AB AB 


According to the law of refraction (Snell’s law), the angle a of the incoming 
ray and the angle 8 of the outgoing ray (measured to the normal of the 
surface) are related by sina: sin 6 = 4:3 for the transition from air to water. 
There is, due to the atmosphere, no total reflection. That is, all light rays 
are at least partially deflected into water. 


ice layer ice layer 


blind angle blind angle 
-p Bo “ 


Fig. 4.75 total reflection Fig. 4.76 situation in the water ... 


Conversely, there is a critical angle 89 at which no light passes from the water 
through the surface. The critical angle 89 is obtained from the maximum 
value of 1 from sina: 


1 4 3 
== > sin fs = => Bye 486°. 
sinB 3 aps gr 


® Remark: A seal that floats on the blind spot (Fig. 4.75) cannot see the Inuit who 
is waiting for it at the air hole in the ice: The only light rays that lead to the hunter 
are shielded by the ice. Even the Inuit sees the seal only when its nostrils stretch 
out of the water (Fig. 4.76)! @ <<< 
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Fig. 4.77 Who sees whom Fig. 4.78 total reflection outside of I* (on the 
now? This question is by no left the track circuit of I* is seen) 
means trivial! 


>>> Application: Who sees whom? 
In the calm pool of water (Fig. 4.77, left), the fish A (the diver) sees: 


“everything” outside the pool, albeit it is distorted greatly. The refracted 
image is inside a circle k on the surface. This circle is the intersection of 
a cone [' of revolution with an aperture angle of 2 x 48.6°; 


the total reflections of those parts of the basin which are outside the 
reflected cone I*, e.g. the fish C (particularly clear in Fig. 4.78, left); 


reflections of the rest of the basin (Fish B) in the interior of k — as a result 
of partial reflection (the calmer the surface of the water, the clearer the 
image); 


e the fishes B and C, even directly! 


On a photograph from outside the basin (for instance, from a diving board) 
you can see all the fishes — to some extent severely distorted. <<< 


>>> Application: refraction at a spherical surface 
Use the approximation sinx » tanz * x for small x (Application p. 383) in 
order to deduce the Formula (4.29) for the refraction at a spherical surface. 


Fig. 4.79 refraction at a spherical surface 
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Solution: 
With the notation of Fig. 4.79, the relations of exterior angles 6 and ay, are 


(1) B=a2+y7, (2) a, =60+8. 


If c, and c are the velocities of the propagation of light in the two mediums, 
then by the law of refraction (Application p. 378), we have 


m=C,:C2=Ssina,:sinag*a,:a2 = (3) ao* an. 
This leads to 

B= “(5+ 8) +4 => 6+B+ny=nB => 6+ny=(n-1)8£. 
Further, the following holds: 


, tanyx ys 


? 


31a 


adeve ©, sinGx Bx a 
g b 


and thus, we find the approximation 


+—= : (4.29) 
g Ob r 
Here, r is the radius of the sphere. g and 6 are the object distance and the 
image distance. The lens laws in this simple form are valid for only small 
angles y = 6 in the vicinity of the optical axis (Gaussian space), which are 
marked by green rays in Fig. 4.79. The orange rays at a greater distance from 
the axis envelope a “focal curve”, and no longer pass through the focus of the 
lens (Fig. 2.10, left). <<< 


>>> Application: How does a rainbow form? (Fig. 4.81) 
A good moment to see a rainbow is when the Sun is low after a downpour 
(Fig. 4.80). Why? How does a rainbow form? 


Solution: 

The wavelengths of the spectrum of visible light range from 380 to 780 nano- 
meters, i.e. slightly less than one thousandths of a millimeter. The individual 
spectral colors are less broken during the transition to an optically denser 
medium if their frequencies are smaller. Therefore, violet is refracted more 
strongly than red. Beyond red, we have the warming infrared rays. Beyond 
violet, we have the harmful ultraviolet rays. 

Now, after a downpour, the air is full of (spherical) water droplets (Fig. 4.81). 
These, we usually see in a diffused form as mist or cloud. Let r be the direction 
of the incoming light rays. At any point of the illuminated hemisphere, a light 
ray is partially reflected at the hemisphere, while the other part is partially 
refracted inside the hemisphere (+ ray r;). The ray is “fanned” into the 
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Fig. 4.80 Two concentric rainbows (primary and secondary). Left: two “real” rain- 
bows were captured from a terrace with a fish-eye lens during a low sun position. 
These shimmering water droplets are many kilometres away! Right: this photo was 
taken with a 30 mm wide angle lens. No rainbow was present in the sky — rather, 
a jet of water was shooting closely past the camera in the shape of a parabolic 
trajectory, producing two clearly visible rainbows. 


different spectral colors due to the different refractive indices. The ray rj; 
meets the sphere from the inside and is partially refracted, partially reflected 
(> ray r2). The ray rg again meets the spherical wall and is partially refracted 
(> ray r3), partly reflected where the fanning is partly undone, but is still 
maintained. The outgoing ray r3 is a further fanned bundle of rays. The 
reflected residual light “wanders further around”, whereby the light ray loses 
intensity by splitting into reflected and refracted light. 

It now seems that r3 is mainly responsible for the rainbow effect. To be more 
precise, it is responsible for the primary rainbow. A further reflection and 
subsequent emergence produces a secondary rainbow (Fig. 4.80). This was 
known by René Descartes (1596 - 1650) almost 400 years ago! 


Ray 3 is partly 
being reflected “q., 


Ray 3 nearly tangential 


ca. 20°-30° —~ 
\ Ray 2 further away 
Ray 1 exits barely 


. 
. 
*. A 
refracted and with es 
only slight loss Zea as 
sn agoearet age ic 


Ray 1 near the centre 


Water droplets 
Ray 2 exits more 
strongly refracted [it’s 


barely being reflected : 
in the sphere wall) “a. ca, 40°-44° 


Ray 3 exits twice in a 
refracted state and 
once in a mirrored state 


Fig. 4.81 emergence of the rainbow 


If we send not just one but an infinite number of parallel rays of light into 
the sphere (Fig. 4.81), most of the rays — being refracted in every direction 
— will leave the water droplets at the back. Those rays that arrive at the 
backside of the droplet approximately at the critical angle of total reflection 
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will return relatively highly fanned towards r3. Fig. 4.81 illustrates that, at 
a certain exit angle (a ~ 43°), there is a distinct maximum of such rays r3." 
For each partial spectrum in the entire spectral range — for each color — this 
angle is slightly different (Violet ...42°, Red ...44°). Viewing the droplet at 
this angle, an appropriate amount of color prevails. 

Thus, all rain drops that “dispatch” rays r3 at the angle a are seen in the 
corresponding spectral color. All these droplets are distributed on a cone of 
revolution whose apex is the eye and its half aperture angle equals a. Thus, 
this cone of revolution appears in a “projecting” manner, and therefore, it 
appears as an arc (Fig. 4.81). The human eye is able to distinguish seven 
essentially different colors, with red on the margin. In the secondary rainbow 
— it occurs for @ x 51° — the order turns back because of the additional 
reflection! 


® Remark: Depending on the position of the Sun, one can see at most a semicircle. 
Looking out of a plane (above the clouds), you can see a whole circle if the Sun is 
high. Since the cone is dependent on the position of the observer, one always sees a 
new rainbow when walking. So, it makes no sense “to look for the treasure that is 
buried at the foot of the rainbow”. ® <<< 


Fig. 4.82 red sunset with de- Fig. 4.83 The underlying theory shows: We can 
lay of five minutes ... see around the bend! 


>>> Application: Why does the setting Sun appear to be Red? 


Solution: 

If the Sun is low on the horizon, the light rays, which contain the entire visible 
spectrum (see Application p. 166), transit the atmosphere at equal angles. 
The atmosphere becomes denser as it gets closer to the Earth’s surface. The 
rays are, thereby, increasingly fanned into the different colors, with violet 
being the most bent towards the front and red the least. Theoretically, the 


1Start the demo program rainbow. exe to see the animated situation. For more theoretical details 
on this issue, see the following interesting website: 
https://plus.maths.org/content/rainbows 
See also Steven Janke: Modules in Undergraduate Mathematics and its Applications, 1992. 
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portion of light that is blue reaches us for a longer period of time than the 
red component (Fig. 4.82). However, due to its shorter wavelength, this blue 
portion is scattered across the atmosphere (which is why the sky is blue). 
So, when the Sun goes down under the horizon, you can see the color red for 
about five minutes (Fig. 4.83)! 


® Remark: Thus, for about five minutes, we have the impression that an object 
is located in the extension of the light rays reaching the eye. Eventually, the red 
portion of light is no longer able to make it “around the bend” and the Sun goes 
down, seemingly deformed to an oval. © <<< 


>>> Application: Fata Morgana? (Fig. 4.84) 

This particular sunset deserves a closer explanation. Both images to the left 
look relatively familiar. From the third image onwards, however, something 
like a reflection appears to emerge. This reflection is then increasingly dis- 
torted in the subsequent images, even when the Sun has already disappeared 
completely. 


Fig. 4.84 3.5 exciting minutes 


Solution: 

A Fata Morgana is an optical illusion caused by the refraction of light in 
layers of air of differing temperature and humidity. As previously mentioned 
(Application p. 168), the Sun should already have disappeared from sight 
several minutes before the picture was taken. Due to the low angle of inci- 
dence, the blue components of light with their short wavelength were already 
fully absorbed by the atmosphere, while the red components still managed 
to go around the bend. Yet, what causes the strange reflection? The specular 
points are not infinitely far away, but are instead located on the surface of 
the water. In fact, due to the curvature of the Earth, they can be much closer 
than one might expect. Depending on one’s height above water level, it is 
possible to see anywhere from several to 100 kilometres ahead (Application 
p. 157), though the latter distance is only possible during clear atmospheric 
conditions. The images on the right were taken a few seconds after the Sun 
had disappeared completely. The fisherman at sea, however, would still have 
been able to see it. The sunrays which hit the surface of the water at the 
fisherman’s position could have been detected by us, although they might 
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have been slightly refracted, owing to the different atmospheric conditions 
of temperature and humidity. The series of photos was taken on the Cape 
of Good Hope at the end of February. On the northern hemisphere, at the 
same latitude, the Sun sets clockwise in the direction west-north-west (App- 
lication p. 305). On the same day on the southern hemisphere, the Sun sets 
in anti-clockwise direction towards the west-south-west. Note the encircled 
tree which appears to move one Sun diameter to the right within a span of 
3.0 minutes. aa 


>>> Application: optical prism (Fig. 4.85) 

The refraction of light and its dispersion into spectral colors can be beauti- 
fully demonstrated by using a three-sided glass prism. In the photo on the 
right, one can see three rainbows. What is the explanation? 


Fig. 4.85 three different rainbows on the right 


Solution: 

At first, the white sunrays that incides on the right (1) are being dispersed 
as on the left. Due to refraction, this does not appear at the location where 
we might expect it (2). The further dispersed light rays eventually exit the 
object (3). This spectrum is reflected in the left surface (4), but it also reaches 
the eye through the right surface (5) after multiple refractions. Due to the 
varying angles of incidence, the spectrum appears as a curved rainbow, and 
unlike a common rainbow in the sky, the red colour appears inside. 


® Remark: The two almost identical reflections of the grasshopper sitting on a mirror 
and of the gecko on the glass window can be explained as follows: In both cases, 
we have two parallel planes. The light is partly reflected in a “classical” manner on 
the plane the animals are sitting on. Other parts of the light enter the glass zone 
that lies behind (refracted). If the incidence angle is small enough, a total reflection 
on the parallel plane occurs. The light is finally “inversely refracted” and leaves the 


first plane parallel to the directly reflected one. When the thickness of the reflecting 
material is very small, this can even create interferences. ® <<< 
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Fig. 4.86 two almost identical reflections 


>>> Application: an hour-long sunset 

On what circle of latitude is the speed of an airliner (* 800km/h) sufficient 
so that, during a flight towards the west, the apparent position of the Sun 
does not change? 


Solution: 
The Earth rotates over a period of 24 hours. The orbital speed vp of a point 
4 k k 
on the equator is, therefore, calculated as a = 24h or vo = 1,667 a 
U0 


(According to Application p. 127, this corresponds exactly to 900 nautical 
miles per hour.) An aircraft flies at this speed over the equator from east 
to west. The Sun’s position does not change. However, vg exceeds the speed 
of sound, and almost all passenger aircraft fly well below this limit. The 
circumference, and thus also the radius r, of the latitude corresponding to 
this circle must, therefore, be smaller than that of the equator: the scaling 
factor 1057 = 0.48 yields r = 0.48 R. On the other hand, the radius of the circle 
of latitude can be calculated according to the formula r = R cosy, where y 
is the (northern or southern) latitude. Thus, cos y = 0.48 => y = 61.3°. 


® Remark: The southern tip of Greenland is somewhat north of this geographical 
latitude and flights from Europe to North America — depending on the weather 
conditions and jet streams — operate in this passage because of the shorter distance. 
So, it is not unusual that during such a transatlantic flight, the Sun “stops” for 
hours on the horizon. The Sun can even “walk backwards” for a short time when 
the latitude is exceeded and/or the travel speed is increased. ® <<< 


>>> Application: true and mean time, time zones 

In true time, 12 noon is defined as the time when the Sun is at its peak. Ac- 
cordingly, all points with the same geographical longitude should have equal 
local times. However, because Earth does not move uniformly due to Kepler’s 
laws through space, the true midday thus varies by +15 minutes. Therefore, 
one uses the term mean noon or the average local time. For practical reasons, 
time zones were introduced (average times for larger areas). So, Central Eu- 
ropean Time is the average local time for locations on 15° E longitude. Since 
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the Earth rotates about its axis in one hour at 360°/24 = 15°, it corresponds 
to the summer time in Eastern European Time (30° E longitude). 


® Remark: The People’s Republic of China extends from 74° E to 135° E, i.e. more 
than 60 degrees of longitude. This would “normally” be enough to cover four time zo- 
nes. However, for whatever reasons, China has decided to introduce “Beijing time”.? 
A sunrise at a specific time (Beijing Time) “somewhere in China” can, therefore, 
take place within an interval of four hours of true local time. Now, if the working 
time of approximately 8:00-17:00 were given, it would mean that one would have 
to start work in the dark for most of the year in western China. ® <<< 


Fig. 4.87 If you mark the points on the celestial sphere where the sun is at, at 
a certain hour, they form an asymmetrical 8-loop. More on this can be found on 
www.uni-ak.ac.at/geom. 


The following two final applications of this chapter stem from triangle geo- 
metry and are more theoretical than applications in this book usually are. 
They are, however, good examples of applications of theoretical results in 
proofs. 


>>> Application: the Law of Sines in a challenging proof 

Triangle geometry is an extremely “mature” discipline of science. Making a 
sensational discovery in this field in our day and age is close to a miracle. 
In the year 1904, the American mathematician Frank Morley discovered the 
following: A theorem whose aesthetics recalls the elegant proofs in Euclid’s 
Elements — despite the fact that the theorem is not fundamentally geometri- 
cal, as we shall soon see. 


Morley’s theorem: If the interior angles of an arbitrary triangle are trisected, 
the intersections of the trisecting straight lines that are adjacent to the triangle 
sides form an equilateral triangle. 


*http://en. wikipedia. org/wiki/Time_zones_of_China 
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Proof: The proof that follows is, in essence, due to D.J. Newman (1996).? We put 
the cart before the horse, so to speak, and begin with the result — an equilateral 
triangle XY Z with side length 1. Then, we draw angles u, v, and w (as shown in 


dn 
the lower figure), each twice. We require that u+u+w = —. The resulting auxiliary 


triangles should not overlap. If we can show that s = s* and t = t*, then we are 
almost done, because by “relabeling”, we obtain equality for the remaining angles. 
First, we show that s+t=s*+t*. 


Fig. 4.88 Morley’s theorem 


20 


This results in s=a7-u-w, t=n7-u-v => s+t= 


2 
and s*+t* =7-|27 . (v+w))= a 


Moreover, by applying the Law of Sines in the triangles X 7B and XYC, we obtain 


1 d 1 e sins  € 


| . . = . ? 
sins sinu sint sinu sint d 
and by applying the Law of Sines in BC'X, we get 


d e sins* e 


> es 
sint* sin s* sint* d 
We may now conclude (because the sine function is monotonic, see p. 240) that 


sins* sins 


stt=s*4+t" a >s=s* a t=t. 


sint* sint 
© ddd 


3 Another proof by G. Schallenkamp — without the use of the law of sines — can be found on 
http: //mathoid.de/data/documents/Morley-Theorem. pdf. 
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>>> Application: The angle of circumference theorem helps 

to find a famous point in triangle geometry (Fig. 4.89) 
Where is the point inside a triangle for which the sum of the distances to the 
vertices is minimal? 


x+y+2z— minimum u 


Fig. 4.89 The Fermat point, ascribed to both Fermat and Steiner, is sometimes 
also referred to as Fermat—Torricelli point. 


Solution: 
Let us now imagine that we have drilled three tiny holes into a disk, through 
which we have pulled three strings of equal length that are tied together at 
the end G. (The triangle that is formed by the holes must not have an angle 
greater than 120°.) To the other ends of the strings, we attach three equal 
weights. The weights will immediately stretch the strings to form angles of 
120°: The three equal forces are then in equilibrium. In sum, the system 
tries to place the weights at the lowest possible location. With a given string 
length d and the distances x, y, and z of the point G from the triangle points 
A, B, and C, the weights hang at depths d- xz, d-—y, and d- z. The sum 
of these depths is, therefore, 3d - (a2 +y+z). If the weights have to hang as 
low as possible, then the expression «+ y+z must be a minimum. It is also 
possible to define G as the point where the sum of distances to the triangle 
vertices is minimal. 
The point G can be constructed by intersecting the two circles or by intersec- 
ting at least two circles of circumference, or much more simply, by extending 
every triangle side outwards into equilateral triangles and connect the new 
points X, Y, and Z with the opposite triangle points A, B, and C: The cir- 
cumcircle of the triangle ABZ is the circle of circumference of the segment 
AB, from whose points the segment on the side of Z is visible at 60°, and the 
segment on the side G at the supplementary angle of 180° — 60° = 120°. In 
this special case, it should be obvious that the three lines actually intersect 
at a point: If a point G is found as the intersection of two lines, then two 
triangle sides can be seen under 120° from G — and the third triangle side 
must automatically be 120°, as the sum of these angles must add up to 360°. 
ddd 


5 Vector analysis 


The importance of vector analysis has risen significantly through the advent 
of computers. Through their use, complex calculations in spatial geometry 
(intersection, measurement, rotation, reflection, etc.) and physics (calculati- 
ons involving the parallelogram of force, center of mass, momentum of inertia, 
etc.) are quickly and easily accomplished. 

In the following chapter, we will mostly work in three-dimensional Euclidean 
space R?. The formulae we will discuss can, if properly applied, also be used 
for calculations in the plane R?: In such cases, the third coordinate of points 
or the third component of vectors is simply set equal to 0. By analogy, vector 
analysis can be applied quite elegantly for calculations in higher dimensi- 
ons. Straight lines in the drawing plane correspond to the z-parallel planes 
(“projecting” in the top view) — and not to straight spatial lines! 

Interesting applications can be found even for techniques as simple as vector 
addition and scaling. With two further types of vector multiplication, one can 
also gain command over the calculation of angles, surface areas, and volumes. 
The wide gamut of applications ranges from problems of analytic geometry 
to physics, including the calculation of the position of the sun. 


© Springer International Publishing AG 2017 
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5.1 Elementary vector operations 


Position vectors 

Let us imagine a three-dimensional Cartesian coordinate system (Fig. 5.1) 
consisting of three mutually orthogonal axes x, y, and z. Each point P in 
space possesses unique coordinates P(pz, py, pz). The vector p = OP emana- 
ting from the origin O(0,0,0) pointing to the point P is called the position 
vector of P, and we write 


P=| Py or sometimes (pz, Py, Dz) (5.1) 


— the second version saves space. Instead of point coordinates, one speaks of 
vector components. 


Fig. 5.1 coordinate system and position vector Fig. 5.2 vector addition 


Length of a vector 

According to the Pythagorean theorem, the following holds for the distance 
of the point P to the origin of the coordinate system, and thus, for the length 
of its position vector 7p: 


Px 
Pl = || Py || = \/ pe + pe t+ p?- (5.2) 
Pz 


Vector addition 
Let D(pr,Py,Pz) and G(dz,dy,dz) be two vectors (Fig. 5.2). Their sum 


Pa t Ux 
pt+q= Py * dy | = (Dx + Ux, Py + dy, Pz + I) (5.3) 
Pzt dz 
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is a vector pointing towards the vertex opposite to O of the parallelogram 
defined by p and g (Fig. 5.2). 


Translation vectors 


If p is interpreted as a position vector of the point P and gq as a translation 
vector, then f+ is the position vector of the translated point P’. In physics, 
vector addition is applied in the “addition of forces” (Application p. 177). 
“Dualism” of a vector: 

If a vector is interpreted as a position vector, it describes a point, and if it is 
interpreted as a translation vector, it describes a translation. This property 
— to which one, admittedly, needs to get accustomed — is the secret behind 
the versatility of vector mathematics. 


® Remark: In physics, the word dualism is used to describe the fact, that radiation 
may be simultaneously described as a wave or as a stream of particles. The dualism 
of light waves<+light quanta is necessary to explain certain properties of light. ® 


>>> Application: parallelogram of forces (Fig. 5.3) 


Two or more forces are in equilibrium if their 
sum is the zero vector. The forces themselves 
do not have to lie in the plane as in the figure 
— their sum must merely be zero. 

Three forces F|, Fb, and F3 are given in the 
plane by their lengths |F;| and polar angles 
a;. The task is to calculate the resulting for- 
ce Fr, which keeps these three forces in an 
Fig. 5.3 “Trigo” system equilibrium (“Trigo”)! 


Solution: 
The resulting force is given by Fr = —(F, + F) + F3). In the plane, we may 


also write 
a |F;| cos a; =  (-d|Filcosa; 
R= 5 ; => FR= ~¥|Fi|sin a; A 


® Remark: Three equally long vectors in the plane which mutually enclose an angle 
of 120° sum up to the zero vector. From this, Formula (4.17) from Application p. 134 
immediately follows. @ <<< 


>>> Application: Leonardo's centrifugal force experiment 

Leonardo da Vinci made spheres rotate on chains as depicted in Fig. 5.4 (the 
central sketch is his). Given a constant angular velocity, the spheres will move 
on fixed circular paths. The task is to determine the angle of inclination of 
the chains. 
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Fig. 5.4 Leonardo’s centrifugal force experiment 


Solution: 

Each sphere experiences its mass G' and the centrifugal force F' (proportional 
to the distance from the axis of rotation and to the square of the angular 
velocity). The resulting force R influences the inclination of the chain (the 
weight of the chain is not considered). 

Numerical example: The distance of the chain connection point is 10cm. The length 
of the chain (towards the sphere’s center) is 14cm. Which angular velocity produces 
a chain angle of 45°? 


Gg 10 m/s? / 10m 7 
F=G 2 V2 x x, 
eee ee r (10 + 14 cos 45°) cm 0.2ms? 5s 


The spheres must, therefore, rotate at a rate slightly faster than one rotation about 
2a 


the central axis per second (with w = =, it would be exactly once per second). The 


result is obviously independent of the spheres’ masses (see Application p. 138). 


® Remark: If one increases the angular velocity, the matter is more complicated. 
Due to the sphere’s inertia, they will “lag behind”, and thus, they will experience 
greater acceleration than the rest of the system. This causes a short-term excessive 
centrifugal force, which “raises” the spheres a bit too far. If the additional acce- 
leration ends, the system returns to equilibrium. Such unstable configurations are 
seen in the computer simulations in Fig. 5.4. With constant angular velocity, the 
imaginary extensions of the chains meet the axis of rotation in a fixed point! © 

ddd 


>>» Application: Why does an airplane fly? (Fig. 5.5) 


Countervortices. 


— Increases air velocity 


Decreases air velocity Vortices around the edge 


Fig. 5.5 A non-trivial explanation of how lifting forces make an aircraft fly. 


Solution: 

The airfoil (usually almost symmetrical in the case of faster aircraft and 
curved upward in the case of slower ones) has a sharp edge in its rear. When 
“pitching” upwind, there arises a counter-clockwise velocity that is strongly 
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dependent on the speed v. (Aircraft with symmetrical profiles have extendible 
flaps which enhance the formation of vortices used for slow flight phases.) 


Fig. 5.6 wing motions of a butterfly (average frequency) 


According to the law of conservation of angular momentum, a vortex arises, 
rotating clockwise around the profile (speed |Av]). If the angle of approach 
is too large, the flow breaks off and the aircraft descends uncontrollably. At 
the opposite vortex, there are velocity vectors Av (by definition directed and 
oriented) assigned to each point. The sum vector v+Av is, therefore, greater 
at the top than at the bottom. 

Due to the aerodynamic paradox, negative pressure acts on the side where 
the air flows at greater speed, and this will generate a buoyant force that 
causes the aircraft to stay in the air. 


Fig. 5.7 fin motions at relatively low frequencies 


@ Remark: A simplified explanation that is occasionally found reads as follows: The 
path of the air at the top is longer than the path at the bottom, so that the air at 
the top must flow at a greater velocity. This explanation is too simple. 

Flying animals — like butterflies — produce the necessary vortex by synchronously 
twisting their wings (Fig. 5.6). At a very high wing-beat frequency, air behaves like 
a denser medium, and flying insects can — similar to how water animals work with 
water resistance (Fig. 5.7) — repel compressed air to create cushions. Aircrafts glide 
at high speed on air cushions. ® <<< 


>>» Application: increasing twisting from inside to outside (Fig. 5.8) 
Photographs show that towards the tip, the leading edge of a bird’s wing is 
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increasingly pulled downwards (pronated). A similar twisting can be found 
in an aircraft propeller. What is the impact of this wing twisting? 


Fig. 5.8 Wings with increasing twisting from inside to outside: dgign, is constant, 
Uap increases from inside to outside. We have Wattack + Upight + Uap = 0, and Dattack 
changes direction (and length) along the wing. 


Solution: 

A wing section can only produce beneficial lift and thrust if the resulting ae- 
rodynamic force is inclined forwards. This requires a favorable angle of attack 
a that is not too large, as may be the case with the proximal portion of the 
wing. If the wing were not twisted in this way, the distal portion of the wing 
would be hit by the flow at too large an angle of a2. Then, the airflow would 
collapse and this section of the wing would fail to produce any aerodynamic 
force. This is because, due to the constant angular velocity of the flapping 
wings, the absolute flapping velocity Uap rises as the distance from the wing 
basis increases, while the flight velocity tignt remains steady (see the two 
velocity triangles). This negative effect can completely be compensated by 
means of pronating wing twisting. The favourably small angle a; is, thus, 
retained across the whole wing span. <<< 


Vector subtraction, direction vectors 


Again, let p and g be the position vectors of 
two points P and Q (Fig. 4.8). The direction 
> 


vector PQ is given by the difference vector 


aoe dx — Px 
PQ =4G-B=| qd - Py (5.4) 
dz — Pz 


Fig. 5.9 vector subtraction 


(“tip minus shaft”). 
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>>> Application: distance between two points 
The distance PQ of two points P and Q equals the length of the difference 


vector PQ = |Po| = |g - pl. << 


Scaling a vector 


The product of the vector by a real number \ (a “scalar”) is defined as the 
scaling of all components: 


A Da 
Ap = PpA=|Apy |. (5.5) 
\Dz 
In particular, the opposite vector —j of p is obtained by setting A = -1. 


Fig. 5.10 simple and multiple pulleys (according to Leonardo) 


>>> Application: pulleys (Fig. 5.10) 

For a simple pulley (incidentally invented by Archimedes in the 3rd century 
BC) where an object of weight fal hangs and moves by means of a movable 
roller on a rope, the pulley is fixed by being mounted to a beam on one side 
and the rope is deflected on the other side via a guide pulley. It is sufficient 
to pull with a force F> ep Find the reason for this and also explain the 
multiple hoists in Fig. 5.10, center and right. 


Solution: 

During lifting, kinetic energy is converted into potential energy. We have 
Work = Force x Distance = Weight x Travel Distance. Pulling the deflected 
rope by a length s with the force F, one performs an amount s- F of work. 
Since the roll above the object is movable and will occupy the lowest position 
at every moment, the change in the rope length on both sides of the roll will 
be distributed, and the travel distance is only 58. Thus, the equilibrium 
condition reads 


> 


i > > > 
ar es G. 
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If F > 4G, then the object is pulled. 

With each additional pulley, this effect occurs again: Only half the power is 
required allowing the load to be lifted halfway. Consequently, with n pulleys, 
Ps JG provides enough force, and the load is lifted by oa s. Thus, with 
three guide pulleys shown in the middle of Fig. 5.10, only one eighth of the 
weight is needed in order to keep the load in equilibrium. 

The pulley on the right (after a drawing by Leonardo da Vinci) makes it 
possible to lift the 13-fold load: By pulling the right rope by a distance s, 
this segment is split up equally among the 13 sections. <a 


If we interpret the vector Ap as a position vector, it is directed towards the 
point that is obtained from P by dilating it from the origin of the coordinate 
system by a factor X. Interpreted geometrically, multiplication of a vector by 
a real number describes a central dilation. 

In physics, this notation allows us to compute centers of gravity elegantly: 


>>> Application: the center of gravity S of a segment AB 

Let us assume that a segment has “dead weight” if the mass is distributed 
homogeneously. Then, the center of gravity and the midpoint of the segment 
AB coincide: 


je 5(a +). (5.6) 


Therefore, the center of gravity divides the segment in the ratio 1:1.  <«« 


Fig. 5.11 balance Fig. 5.12 center of gravity of a triangle 


>>» Application: center of gravity S of a triangle ABC (Fig. 5.12) 

The geometric construction of the center of gravity of a triangle (centroid) is 
just the intersection of the medians of the triangle. The following important 
theorem holds: 


The center of gravity of a triangle is obtained by the formula 


$= 3(a+b+0). (5.7) 


The centroid of a triangle divides the medians in the ratio of 1: 2. 
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Proof: Let Ha = (b+ 2) and H, = $(a +b) be the midpoints of BC and AB. Then, 
we use the ansatz 


S=C+u-CH,=a+v:-AH, 
: 1 ae 1 oe. a 
= é+u-(5(4+d) é)=4 u(5(o+2) a) 
> 5a 5b (l-u)é=(1-v)a+ 5b ao 
A comparison of the scalars of b shows $= 3 > u=v. 


The scalars of a result in 5 =1-v=l-u>u=v - These solutions for wu and 
v also induce the equality of the scalars of é. Thereby, it is already proven that the 
medians are divided by the centroid in the ratio of 1: 2. If we now set u =v = 2 in 


the first equation, we obtain the desired formula (5.7). © <<< 


The center of gravity of a general planar n-gon is not ees by simply 
adding the position vectors and multiplying the sum vector by = 4+! One must 
instead disassemble the polygon into triangles (“triangulate”), calculate the 
individual centers of gravity and their areas, and combine them. How this is 
done is shown in Formula (5.16). In Application p. 218, we will calculate the 
center of gravity of a general quadrangle. 

In the special case of a regular polygon with an even number of vertices (Fig. 
5.13), it is obvious what the center of gravity is. The same applies to all 
polygons, resulting from the above through stretching or compression (that 
is, by an “affine mapping”), in particular this holds for all parallelograms. 


® Remark: If you draw a triangle in which a straight line cuts the triangle into two 
equal parts (in general, a triangle and a quadrangle), these generalized focal lines 
do not go through the center, but envelope a curve. See Application p. 346). ® 


>>» Application: the centroid S of a tetrahedron ABCD (Fig. 5.14): 

By no means must a tetrahedron be regular. The geometric center of gravity 
is the intersection of the “medial planes”. In terms of vectors, the centroid 
equals 


1 7 . 
5= 7(G+b+é4d). (5.8) 
The center of gravity divides the “medial planes” in the ratio of 1: 3. 


D 


c 
B 


Fig. 5.13 center of gravity of a hexagon... Fig. 5.14... and a tetrahedron 
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The proof is derived analogously to the one above by choosing the centers of 
gravity of two triangles instead of midpoints H, and H,. This yields u = v = 3. 
which shows the specified division of the medial lines. 

® Remark: The calculated centroid is the center of gravity of the four vertices and 
at the same time the center of gravity of the tetrahedron considered as a full solid. 
It is different from the center of gravity of the six edges and also different from the 


center of gravity of the four triangular faces. @ <<< 


Again, the formula applies only to a tetrahedron, but not to general polyhedra 
with n > 4 vertices. The point obtained by computing the “average” of the 
position vectors d; of the vertices A; 


> ij (5.9) 


is the center of gravity of the n vertices A;. It is also the center of n spheres 
of equal weight centered at the points A;. 

In addition, there is the center of gravity of the edges, which occurs when the 
polyhedron is considered only as a wireframe. In the following, however, we 
are most interested in the center of gravity of full solids. 

For centrally symmetric polyhedra, it is intuitively clear that the different 
kinds of centers of gravity coincide. In particular, this applies to regular 
prisms and cubes and to affine copies of them (where the edges are not 
perpendicular to the base surface). 


Parametric representation of a straight line 


An arbitrary point X in space (Fig. 5.9) with the position vector % on the 
line PQ can now be described by the vector equation 


%=p+APQ (Areal). (5.10) 


We give two examples: 


>>> Application: dividing a segment 

The points P and Q correspond to the values \ = 0 and 4 = 1. The points 
between P and @ are described by 0 < A < 1. In particular, the center M 
corresponds to the value 


This results in a. dee, 
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>>> Application: reflection of a point Q in a point P 
For the reflected point Q*, we must set A = -1: 


[* = B- (G-B) = B-4. (5.12) 


An important physical application of the parametric representation of a 
straight line is again the calculation of the center of gravity: 


>>> Application: center of gravity of a complex solid with homogeneous 
mass distribution (Fig. 5.15) 

We assume that a solid is composed of two simple building blocks with masses 

My, and M2 and respective centers of gravity S; and S2. Then, S is the overall 

center of gravity that lies on the line $15» 


S= 51 + AS) So. (5.13) 
According to the lever principle, for the parameter A, we have 
Ms, 
AM, =(1-A) M- A= : 5.14 
1=(1-A)M, = ree (5.14) 


Together with equation (5.13), this gives 


1 
co Mis, + Mos). zl 
8 TAeaY (M151 282) (5.15) 


When dealing with homogeneous solids, we can compute with volumes in- 
stead of masses. This process can be repeated if there are more than two 
partial solids, where we apply the same technique: 


1 
~M; 


Misi. (5.16) 


— 


>>> Application: the common center of gravity of the Earth and the Moon 
(Fig. 5.16) 

According to Formula (5.15) and with 4 = 81Mo, we have § = 5 (815) +8). 
Because d x 384, 000 km, the common center of gravity, thus, lies at a distance 
& x 4,700km from the center of the Earth, that is still within the Earth 
(Earth’s radius, r » 6,370km). The Earth and the Moon rotate around this 
common center of gravity, that is, the Earth “wobbles”. 

® Remark: Analogously, the Sun “wobbles” around the common center of our plane- 
tary system. We now know that there are other suns (=“stars”) which also “wobble”. 
With the fact that the double planet Earth-Moon rotates around the common center 
of gravity S', one can also explain why there are high tides and low tides twice a day 
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Fig. 5.16 common center of gravity of the Earth 
Fig. 5.15 multiple blocks and the Moon, two tidal bulges 


(Fig. 5.16): The first tidal bulge is at the point F that is closest to the Moon. This 
point on Earth is affected the most by lunar attraction, while it is only minimally 
affected by the centrifugal force during the rotation around S. The second tidal 
bulge occurs at the exact opposite point F) — we get the maximum centrifugal force 
due to the rotation around S because it is so far away from this and more than 6 
times as far as the Moon’s next point. The two tide mountains move due to the 
rotation of the Earth over the course of almost 25 hours (the Earth has to “overturn” 
in order to “catch” the next migrated moon, Fig. 8.56). ® <<< 


>>> Application: center of gravity of a regular pyramid 
or a cone of revolution 


Fig. 5.17 center of gravity of a pyramid Fig. 5.18 truncated pyramid 


Let P,, P2,...,P, be the vertices of the base and let T be the tip of a regular 
n-sided pyramid with height h (Fig. 5.17). (For very large n, the pyramid is 
close to a cone of revolution.) The pyramid can now be subdivided into n 
tetrahedra M P;P;,:7T. Their volume centers of gravity S; are the arithmetic 
means of the four vertices according to formula (5.8). The centers S; form a 
regular n-gon at height A above the base plane. Since all sub-tetrahedra have 
the same mass, the overall (volume) center of gravity coincides with the center 
of the regular n-gon, and thus, we have (indeed, also for the tetrahedron) 


oe ak, wpe, ce (one 
ar m) = 7 (3m +t). aa 


Formula (5.15) also works when we “subtract” one solid from a second solid. 
In this case, the extracted mass is negative. 
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>>> Application: volume center of gravity of a regular truncated pyramid 
A truncated pyramid (frustrum) is obtained when we cut off a (scaled but 
smaller version) I of the initial pyramid Il, (Fig. 5.18). 

The pyramid I, has height h;, mass M;, and volume center of gravity 51 
with S)T = 3h. The pyramid Ilz has height hg = kh,. Then, its mass is 
M2 = k? M,. For its center of gravity So, we get SoT = ho = 3E hy, and thus, 


— 1 a a 
fs (BP - 1 ST) S 
=S Mn! 151 2 SoT’) 
1 3 8k, a 3(1-k*) 
= M 3M, — hy. 
rary a oe 4(1—k3) "+ 


Special cases occur for k = 5 i= ST x 0.80h; andk=2>ST#0.89h;. «<< 


>>> Application: Stretch those legs backwards! (Fig. 5.19) 

Flamingos fly with extended necks and legs. Their center of mass is located 
in the middle of the trunk, around the sternum. The bird can thus rotate 
about its lateral axis. If the depicted flamingos did not stretch out their legs, 
they would rotate in a counter-clockwise direction (G. Glaeser, H.F. Paulus, 
W. Nachtigall: The Evolution of Flight. Springer Nature, Heidelberg 2017). 


Fig. 5.19 Torque compensation avoids tilting 


In order to eliminate these tilting tendencies, flamingos must move their 
wings in a complex manner, which would require additional metabolic energy. 
However, torque compensation allows them to focus their flight performance 
on moving forward and generating lift. Their flight muscles are relatively 
weak in comparison to those of other birds. Yet it is possibly due to their 
balanced flight that flamingos are still capable of flying distances of up to 
500 km. <<< 
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5.2 Dot product and cross product 


Vectors can be multiplied in two different ways. The dot product results in 
a real number. The cross product results in a vector. Both products play a 
central role in vector analysis. In particular, they are used for the determi- 
nation of lengths, angles, areas, and volumes. Further, the intersection of 
lines/planes, distances between points/lines/planes, and reflections can be 
computed. 


Dot product 


Let G(az,ay,az) and 7i(nz,ny,nz) be two vectors. Then their dot product is 
defined by 


Na Ay 
HG = Get =| ny] >| @y | = Ne Gz + Ny Gy + Nz Az. (5.17) 
Nz az 


By definition, two vectors @ and 7 (neither being zero) are perpendicular to 
each other if their dot product vanishes: 


Ala © a n=O. (5.18) 


The latter equation is actually a special case of Formula (5.36) and is verified 
there. 

For a vector G@ in the plane (two-dimensional), all possible normal vectors can 
be immediately found by 


a= (**] => n=a(-*). (5.19) 
ay Agr 


Using vector analysis, we can elegantly prove the following fundamental theo- 
rem of Descriptive Geometry dealing with the images of right angles: 


Theorem of the right angle: In a normal projection, a right angle appears as 
a right angle if at least one of the two legs is perpendicular to the projection 
direction (parallel to the plane of projection), and neither is projecting. 


Proof: We choose a coordinate system such that the image plane coincides with 
the xy-plane. The two legs of the right angle are described by the vectors @ and 
7, where Formula (5.17) is valid because @ 1 7%. In the normal projection onto the 
xy-plane, the z-component simply disappears. If the projections of a@ and 7 again 
form a right angle, then we have az nz + dy ny = 0. If we subtract Formula (5.17) 
from the latter equation, we obtain a,n, = 0. This is only satisfied if either a, = 0 
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5.2 Dot product and cross product 


1 
Fig. 5.20 Application of the theorem of right angles: A cuboid is displayed in 
different ways in a normal projection so that the drawn right angle can be seen 
“undistorted”, i.e. as a right angle. The direction of the projection has to be per- 
pendicular to the edge 12, which can be done in an infinite number of ways. 


and/or nz = 0, that is, if at least one of the two vectors is horizontal (i.e. parallel to 


the image plane). © 

>>> Application: When exactly do spring and the other seasons begin? 
The beginnings of spring and fall are characterized by the fact that day and 
night have the same duration (equinor). Summer and winter begin when 
the nights are longest or shortest respectively. How can we find the exact 


Oe : . 
. os equinoctium 


plan view onto plane 
of the ecliptic 


moment? 
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Fig. 5.21 The theorem of right angles helps to find the beginnings of the seasons. 
Left: generic normal projection, right: plan view. (The shape of the elliptic orbit is 
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exaggerated to show the construction more clearly.) 


Solution: 
Due to Kepler’s law, the orbit of the Earth (ecliptic) is an ellipse with the 
Sun in one focus (Fig. 5.21). The axis direction of the Earth stays constant 


during the year. If we consider a plan view onto the plane of the ecliptic, the 
Sun rays that reach the Earth are always principal lines. 
The equinox happens when the terminator of the Earth goes through the 
poles. At this moment, the Earth axis and the sun rays are perpendicular. 
Thus, the desired moment can be determined accurately by means of the 
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theorem of right angles: The plan views of the corresponding Earth positions 
lie on the normal to the plan view of the axis direction. 

Perpendicular to this direction, we immediately get the positions where the 
angle ao between the sun rays and the axis direction is extreme — this leads 
to the longest days and nights. 

Proof: To prove this, we again need the theorem of right angles (Fig. 5.22): Let E 
be the position of the Earth where the plan view of the axis direction runs through 
the sun S. o is determined by a right triangle EAC, where A is a fixed point on 
the axis (FA is constant) and C is its foot on s. 


E* 
a it canteen - 
4 
Ss E i a oct — Ss 
c om. ee 


E** Re - fe 
ste ———_ A** 


« 


Fig. 5.22 the extreme angle between the Sun rays and the Earth axis 


In the given position, the triangle appears as an edge. In any other neighboring 
position E* or E**, the foot C can be constructed as the planar foot of A, since 
s is a principal line. This means AC’ grows longer, and thus, 7 becomes smaller in 
both directions, so that o has its maximum in E. ©) daa 


Cross product 


If 7% is normal to two vectors @ and b, its components have to satisfy additio- 
nally the condition nz by + ny by + nz bz = 0. The so-called cross product 


7 Ar by dy bz — Az by 
nw=Gxb=|ay|x| by] =| az bz — az bz (5.20) 
az Ds Ay by — dy by 


gives a vector that fulfils both conditions and is, therefore, normal to both a 
and b: 


n=axb => G@-i=b-A=0. (5.21) 


Proof: We check the condition for the vector @ (analogously for b) by calculation. 
We have 


Gy bz — az by Ax: 
TheG =| az by — Gz bz | + | dy | = (ay b--az by )az+(az bz—-Gz bz )dyt (Az by—ay by )az = 0. 
Ay by — Ay by Az 


Note that the order of the vectors is important: 


Gx b=-bx a. (5.22) 
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If one likes to compute (2,2)-determinants, Formula (5.20) can also be re- 
written as follows: 


dy by 
az ob, 
axb= -| % ‘a (5.23) 
fea 
dy by 


>>> Application: buildings with a pitched roof (Fig. 5.23) 
Compute the normal vector of the roof surface PQR and check the result. 


Solution: 
From the drawing in Fig. 5.23, we read the 
coordinates of the points P, Q, R: P(0/5/5 5/5), 


Q(-5/5/3), R(0/0/5). We get i = PQx PR = 


-5\ (0 0-0—(-2)-(-5)\ -/-10 
-| 0 |}x{-5] =] (-2)-0-(-5)-0] =| 0 
=) AU (-5)-(-5) -0-0 25 


Fig. 5.23 pitched roof 


Check: 71 PQ = (-10)-(-5)+0-0+25-(-2) = 0, 
fie PR = (-10)-0+0-(-5) +25-0=0. 


® Remark: For many applications, it is important that the normal vector is oriented 
outwards. If necessary, the vector can be “reversed”. © <<< 


Implicit equation of a plane 


In general, three points P, Q, R determine a plane (Fig. 5.23) with a uniquely 
determined normal vector 


i = PQ x PR. (5.24) 
For any other point X (position vector Z) in the plane, the following holds: 


—=> —=> 


PX in => PX-.n=0. 
With PX =%-, we have 
ne (Z-p)=0>n-L=n-p 


where 7-p = cis constant. The equation of the plane can, therefore, be written 
as follows: 


NeL=C. (5.25) 
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>>> Application: equation of a plane 
What is the equation of the plane « = PQR from Fig. 5.23? 


Solution: 
With the results from Application p. 191, we get 


-10 -10\ /0 
e: | 0 |.#=] 0 ]-[o] = 125. (5.26) 
25 25) \5 


We have hereby inserted the point R. We could also have used P or Q to 
obtain the same result: 


—10 0 —10 —5 
O J-]5]}=] O J-] 5 | =125. 
25 5 25 3 


® Remark: The constant c = 7-p = eG = n-F still depends on the length of the 
normal vector and this, in turn, on the points P, Q, and R. Instead of 


-10 -2 
O |-% = 125, we can write | 0 | -% = 25. Notated differently, the equation is, 
25 5 

then, -24+52z=25.@ <<< 


>>> Application: plane of symmetry between two points 

Solution: 

The plane of symmetry (bisector) o can be interpreted as the locus of points 
in space that have the same distance to P and Q. 

The normal vector r of o between P and Q is the vector nz = PQ =q-p. 
As a point of 0, we choose the center m = $(G+ 3) of PQ. Thus, o has the 
equation 1, -% = 11, +7 = (G- B)-5(G+ B) or 


o: (G-9)-B=3(7?-P?). (5.27) 


® Remark: In R?, equation (5.25) can be interpreted as a parameter-free equation 
of a line. Formula (5.27) is the equation of the bisector of the segment PQ.® <<< 
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5.3 Intersecting lines and planes 


In space, the following three intersection tasks often occur: 


Intersection of a line and a plane 
Let a line g = PQ be given by its parametric representation Z = p+ \POQ, 
and let a plane € have the (implicit) equation m-% = c. Then, the point of 


intersection S' = gne has to satisfy the condition 7-(p+A PQ) =c. Therefore, 
the parameter value of the point S equals 


C—lep 


5 eal 
ii»PQ 


(5.28) 


For riePQ = 0 (g || ce or gc €), Formula (5.28) fails, because there is either 
no intersection or all points of g are contained in ¢. 


Fig. 5.24 intersection of plane and line ‘Fig. 5.25 intersection of three planes 


>>» Application: shadow of a point (Fig. 5.23) 
Determine the shadow T; of the point T(-5/5/6) sketched in Fig. 5.23 on 
the roof plane PQR with the light direction (1, -2,-3). 


Solution: 
We have to intersect the straight line s through T' with the direction vector 
8 with the roof plane in order to get T). 


-5 1 
® Remark: The light ray § reads % = | Sy) | +X (} Inserted into Formula (5.26), 
6 -3 


-1 
we get | 0 | *@ = 125 and 


25 
-10 -5 1 
O Jel] 5 J+A] -2]] = 125 
25 6 -3 


200 - 85A = 125 > A= c x 0.882, 


which leads to 
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and therefore, 


—5 1 -4,12 
t,=] 5 1+0.882]-2]=] 3.24 
6 -3 3.35 
® dq 


>>> Application: anamorphoses 

Consider Fig. 5.26, right: A (fictional) hemisphere is projected from an eye 
point F (top view E’) onto the ground plane ¥. For this, the lines of sight EP 
are to be intersected with ¥y in order to get P° = EPnv. From the position E, 
we can — with one eye — see no difference between the point in space and the 
image point (Fig. 5.26, left). Thus, one can easily produce optical illusions 
which are called anamorphoses. 


Fig. 5.26 From the correct position, the characters merge with the projection. 


Numerical example: The eye point E(0/0/a) shall lie on the z-axis at the height a. 
P(pz/py/r) is a point at the height of the sphere’s radius r. The line of sight EP has 
the equation Z = (0,0,a)+A(pz,py,r—a). For the intersection P° with y (z = 0), this 
results in the constant a+ A(r-a)=0=A=a/(a-1). Therefore, P°(A pz/Apy/0). 
Since the affine ratio (EPP°) is constant, we see figures of constant size, as shown 


in Fig. 5.26, established right over the points P°, as well as figures of constant size 
on the spherical boundary. <<< 


Intersection of three non-parallel planes 
Let 1-% = 1, N2-% = cg, 13+% = cz be the equations of three planes (Fig. 
5.25). Then, the coordinates of the intersection point S(sz, sy, 5z) is the so- 


lution of a system of three linear equations with three unknowns (Formu- 
la (5.29)). This can be solved by Cramer’s rule. 


N1gSx tT NMySy tT N1z5z =C1, 


NogSe t+ NoySy+N2282 = C2, (5.29) 


N3e5a 7 NZySy 7 N3z8z = C3. 


If we are not able to solve the above system, then two planes are either 
parallel or identical. The main application of this is to solve the following 
tasks: 
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Intersection of two non-parallel planes 


The line g of intersection of the two planes 
nyk =C1, Nz 6% =C2 


has an equation of the form % = §+A7. The direction vector 7 is perpendicular 
to both of the planes’ normals 


f= mT x no. (5.30) 


Now, we need a point S(sz,5y,5z) on g. In general, 7 is not parallel to the 
xy-plane, and so, r, # 0. Then, S can be found in the plane z = 0: We intersect 
the three planes 

Ny @ = Ci, Nz 9k = (2, z=0 


and obtain the (3,3)-system 


N1gSx + NySy tM128z = Cl, 
NQ¢8x + NQySy TN2zSz = C2, 
8, = QO, 


which is really just a (2,2)-system 


NMaSx + N1ySy = Cl, 


N228¢ + Nyy = C2, (5.31) 


and can be solved using Cramer’s rule. In the case r, = 0, one can use the 
plane x = 0 for rz; + 0. Otherwise, the plane y = 0 is used. 


Fig. 5.27 generic rotation Fig. 5.28 center of rotation 


>>> Application: generic rotation in space (Fig. 5.27) 

A line segment PQ is to be moved from one position P,Q, into another 
position P,Q by rotation. The axis of rotation is found as the intersection 
of the planes of symmetry of P; P and Q1Q2 using Formula (5.27). A detailed 
calculation with specific values can be found in Application p. 228. 

® Remark: In general, two congruent triangles cannot be transformed into each other 
by rotation. This requires a helical motion (the main theorem of spatial kinematics, 
see p. 309). ® <a 
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In R?, there is only one task related to intersections, namely the intersection 
of two non-parallel lines. Let ny -% = c, and n2-% = co (with two-dimensional 
vectors!) be the equations of the two lines. Then, the coordinates of the 
intersection point S(sz,5,) satisfy the system of linear equations (5.31). 


>>» Application: center of a rotation (Fig. 5.28) 

Find the center (“pole”) of the rotation that transforms the triangle A,B,C 
(system ©) into the triangle Az BjC 2 (system ¥2). 

Solution: 

P is the intersection of the bisectors of A, Az and B, By (again found with 
Formula (5.27)). 


Numerical example: 
Let A;(-3/0), Bi (0/0), A2(1/0), Bo(1/-3) be given. Then, we have the normal 
1 
-3 


» QealO-Cl 


vectors A; A» = (;) , By Bo= ( ) The bisectors (Formula (5.27)) are 


The (2, 2)-system has the form 


4-5,+0-8, = 4, 


1-5, -3> Sy (5.32) 


| 
On 


and, therefore, the coordinates of the center of rotation are (-1,-2). 

The computation of the rotation angle y can be done with Formula (5.36). 

The fact that two directly congruent triangles in a plane can be transfor- 
med into each other by a rotation is of fundamental importance for planar 
kinematics (see p. 299 and the following). <<< 


>>> Application: the circumcircle of a triangle 
Determine a point which is equidistant to all the three vertices of a triangle. 


Solution: 

Let ABC be a triangle and let U be the circumcenter. Because U is equidi- 
stant to all vertices A, B, C, it has to lie on the bisectors of AB and BC (or 
AC). The radius r of the circumcircle is equal to the length UA. 


Numerical example: 
For A(-3/0), B(2/-4), C(1/5), we obtain U(0.8/- 0.8) and r = 3.88330. << 
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5.4 Distances, angles, areas, volumes 


Normalizing a vector, unit vector 
According to Formula (5.2), a vector & has length |0| = \/v? + v7 + v2. Scaling 


w by the factor A = a returns the associate unit vector, i.e. the vector with 
length 1 


— 


0 = — (5.33) 
al 


parallel to @ with length 1. 
This process is called “normalization” of the vector. Unit vectors are usually 
labeled with the index 0. For each unit vector vp, we have 


09 +09 = Vy = 1. (5.34) 
1 

Numerical example: The vector 0 = | -2 | has length \/1? + (-2)? + 22 = /9 = 3. The 
2 


1 
associated unit vector is, therefore, v9 = 3 —2]. 

2 
Drawing a segment on a line 


Using normalized vectors (unit vectors), one can draw a segment of prescribed 
length on a given line. If we find, for example, a point R on the line PQ at 
a distance d from P, then this point is determined by the vector equation 


7 =p+dPQo. (5.35) 


Measuring angles 


Unit vectors are also the key to measuring angles in space. Let @ and by be 
two unit vectors enclosing the angle y. Then, we have the following important 
formula: 


Qs 
° 
ot 


cos y = do+bo = (5.36) 


bl 


Qt 


—= > —, — 
Proof: With the notation in Fig. 5.29, we have ON + NB = OB where ON = 
—. 
cos pap and NB =siny no (7 1 @). This leads to the ansatz 


cos dy + Sin y Tip = bo. (5.37) 
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a) Squaring both sides in (5.37) yields 
~2 
cos” p do” +siny” ip” +2 sinycosydy+np = bo . 
2 so ees 
1 1 1 


Together with cos? p+ sin? y = 1, we deduce the condition of “orthogonality” 
do +o = 0. 
b) Multiplying (5.37) by do gives 
cos Y ao" +siny Np +do = dig +bp > cos yp = do «bo. 


1 0 


© 


With Formula (5.36), one can calculate the angle between two lines or two 
planes, or the angle between a line and a plane. 


no! 


/ 
/ 
in 
B 


Fig. 5.29 dot product Fig. 5.30 angle of incidence and best utilization 


>>> Application: the angle of incidence (Fig. 5.30) 
What is the angle of incidence a of the light ray §(1,2,-1) measured to the 
normal of the horizontal base plane 7? The next question would be, how 
bright is the base plane, or rather, what is the associated energy input (see 
Application p. 226)? 
Solution: 
The light ray points downwards. So, we measure the angle to the negative 
Z-axis: 

St . 1 
1% = 7th: = 0 « 8h = = 2 r 


-l V6 = 


cosa = No-Sp = 0.408 > ax 66°. 


Following Lambert’s law, the brightness of the illuminated surface h is pro- 
portional to the cosine of the angle of incidence (0 < h < 1). How bright is 
the base plane? 

From the above calculation, we get 


h = 0.408 » 41%. 
® Remark: Without knowing Lambert’s law, one can guess that the fly seen on the 


right in Fig. 5.30 quickly warms up in the morning Sun when the angle of incidence 
between the wings to the light is as steep as possible. © <<< 
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>>> Application: a designer-lamp (Fig. 5.31) 

An aluminium-bracket serves as a reflector of a lamp. Compute 

1. the angle BCD (cosy = CBo+CDp), 

2. the angle ADB (cos6 = DAg» DB), and 

3. the bending angle (angle (6 enclosed by the planes ACD and BCD). Pos- 
sible additional questions: What are the lengths of the edges? What is the 
coat surface of the bracket? 

Solution: 

The coordinates of the relevant points can be read off from Fig. 5.31 and are 


A(4/0/0), B(0/3/0), C(4/3/0), D(0/0/-8). 


Then, with cosy = BC osCDo; we get the angle y » 65° and with cosd = 
DAo DEG, we find the angle 6 » 33°. 

Now, we calculate the bending angle: The angle subtended by the planes 
ACD and BCD is measured as the angle of the normalized normal vectors 
m1 and m2. From this, the supplementary angle (i.e. the difference from 180°) 
is given, because the bracket is to be bent to about 6» 81°. <<< 


Fig. 5.31 designer-lamp Fig. 5.32 point with minimum speed 


Distance of a point to a plane 


Let T be a point in space and let 79-% = c be the equation of a plane e«. Here, 
No is already normalized. Then, T has the (signed) distance from ¢ 


d=Té =1ip-¢-c. (5.38) 
Pedal point in a plane 


In particular, the pedal point F’ of T on the plane ¢ can be easily determined: 


fet-adiget= (1% +t — c) ro. (5.39) 
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>>> Application: the brightest point in a plane 

Let T be a light source. We will look for the point F of the planar roof which 
is illuminated the most (Fig. 5.23). 

Solution: 

The brightest point is the pedal point of T on the roof plane e, since the angle 
of incidence of the light rays measured to the perpendicular is minimal. <«<« 


>>> Application: rotating a straight line (Fig. 5.32) 

A straight line g rotates about a point T’ and envelopes a circle. Find the 
point F’ on g with the lowest orbital speed (i.e. the point of contact of the 
circle and the line g). 


Solution: 
Since we are in the plane, the third component of the vectors will, therefore, 
be zero. We use Formula (5.39) with mo 1 PQ (cf. Formula (5.19)). <<< 


Distance of a point to a straight line 
This problem has already been dealt with when discussing the intersecti- 
on tasks: In order to find the point F’ on a straight line g = PQ which 
has the smallest distance from a given point T (the pedal point of the nor- 
mal), one intersects the normal plane v of g through T with g (Fig. 5.33): 
g=PQ: =pt+Aro (7 ... normalized vector PQ), 
Vi: fo-£=7o-t=C, 
UNG: Foe(P+ Ao) =C> y-PtrARG =7Foet. 
ee 
1 
This yields the parameter Af of F', and consequently, the distance d = Tg 


equals 


\p=fo-(f-f), d=TF. (5.40) 


>>> Application: shortest distance 
A plane flies along a straight line PQ and passes an observer T. When is 
the observer closest to the plane and what is this distance if the points are 


P(3/0/1), Q(0/4/1), T(0/0/0). 


Solution: 


9 . [3 
Ap =foe(t Die ae O|+ 
1 
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d=|f-d=2.6. on 


Fig. 5.33 the pedal point of a straight line Fig. 5.34 a circle in space 


>>> Application: circle in space (Fig. 5.34) 

A point T rotates about a line g = PQ. Where is the center M of the circle 
and what is the radius R? 

Solution: 

The center M of the circle is the pedal point of P on g and R= MT. <<< 


Distance between two skew lines 
Two lines g = PQ and h = RS which do not lie in a plane, in general, have no 
intersection (that is, they are “skew”). It makes sense to define the length of 
the common normal of g and h as the distance d= gh. The common normal 
has the direction 7i = PQxRS (normalized no). The following vector equation 
holds (Fig. 5.35): p+ PF, + FF, =*+ RF, thus, p+ PQ + driy = 7 + uRS , 
and _ _ 

APQ + drip = PR+ wRS. (5.41) 


Fig. 5.35 common normal Fig. 5.36 approximations of a 1-sheeted hyperboloid 


We take the dot product of the equation with no. Because no 1 PQ => 
rio» PQ = 0, rio L RS => rio RS = 0, and np? = 1, we find 


d= tig» PR. (5.42) 
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If we define d as the absolute value of this dot product, then d is always 
positive. 

The parameters A and pu of the pedal points Fy and Fy}, are obtained by taking 
the dot product of the equation (5.41) with the vectors mp, = RS x mp and 
Ng = PQ x No: 


in» PR t,-PR 
ee ge (5.43) 
nine PQ Ng? RS 


>>» Application: one-sheeted hyperboloid (Fig. 5.37) 

A straight line g rotates about an axis h and sweeps a one-sheeted hyper- 
boloid. Calculate the center M of the hyperboloid and the radius r of the 
smallest circle (waistline). 


Fig. 5.37 two families of straight lines 


Solution: 

M € his the pedal point of the common normal n of g, and h, r is the shortest 
distance between the two lines. 

The reflection of g in the plane determined by hf and n results in a straight 
line g which sweeps the same hyperboloid if rotated about h. Thus, there are 
two families of straight lines on the surface. This allows a sturdy construction 
of a giant walled structure in the form of a hyperboloid. 


® Remark: The cooling towers of nuclear power plants (Fig. 5.38) are examples of 
this. The nozzle form of the towers causes an acceleration of the water vapour in 
the upper third section of the tower. ® << 
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Fig. 5.38 practical application 


>>» Application: hyperbolic paraboloid (HP-surface) (Fig. 5.39) 

If one draws the lines meeting one axis a at a right angle from all points on a 
line g, then these lines sweep an HP-surface ®. The line a and the common 
normal n of a and g are the generators at the vertex S of ®. 


Fig. 5.39 An HP-surface as the set of lines meeting two lines (one is met at a 
normal). Right: There are two families of straight lines on the HP-surface! 


Conversely, one can take any arbitrary line meeting these two — in Fig. 5.39, 
for example, the boundary h — and then collect the horizontal lines meeting n 
in order to define the surface. Thus, the HP-surface also carries two different 
families of straight lines. Except for the one-sheeted hyperboloid (Application 
p. 202) and the hyperbolic paraboloid, there is no other surface with this 
property! According to the Theorem of Right Angles (p. 188), the two families 
of lines form a rectangular grid when projected orthogonally onto the plane 
spanned by a and n. Therefore, HP-surfaces are sometimes used for covering 
rectangular floor plans. <<< 


Area of a triangle (parallelogram) 


For the calculation of areas (area of a triangle), the cross product can be 
used: Let P, Q, and R be the vertices of a triangle. Then, the area A of the 
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triangle is computed via 


A=4|PQx PRI. (5.44) 


The area of the parallelogram spanned by PQ and PR is twice the area of 
the triangle PQR. 


Proof: We give just a brief sketch: For the area of a triangle (“height times base 
divided by 2”), we have 


A=3(a| || sin y. 
Now, we have to show that 
a ‘ink SS 
= 3|PQx PR, 
holds, and so, 
> — os 
|a| |b] sin yy =|PQ x PRI. 


Squaring both sides of the equation, (Formula (5.36)) sin? y = 1— cos? y and 


— ab 


cos p = dy +bo = i 


lal 


allows us to obtain a true statement with some laborious calculations. © 


>>> Application: the “Spatial Pythagoras” 

In the plane, the Pythagorean theorem holds. If you intersect a right angle 
(vertex C’) with a straight line, then the chord AB = c associated to the legs 
AC =a and BC = b can be calculated via c? = a? + b?. 

This theorem admits a generalization (in fact many) to a spatial version: 
If you intersect a rectangular tripod (vertex D of a cuboid) with a plane, 
then the area d of the triangular cross-section ABC can be expressed in 
terms of the areas a, b, and c of the triangles DAB, DBC, and DAC as 
=e +h ec". 

Proof: Let D be the origin of a Cartesian coordinate system with the points 
A(u/0/0), B(O/v/0), and C(0/0/w). Then, the triangles ASB, BSC, and CSA 


have areas 


2° 2 20° 
The area d of the triangle ABC is calculated using the cross product 


yas ce yO -u 1 (few 1 
d=5-|ABx AC| = =. lv [x] 0 [f= =-|[ wal] = =. (ow)? + (wu)? + (uv)?. 
2 2 0 2 2 
w wu 
Therefore, we have d? = : [ (vw)? + (wu)? + (wv)? ] =? +e? +a?. © <<< 
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Fig. 5.40 Spatial Pythagoras Fig. 5.41 a pitched roof 


>>> Application: Determine the area of the triangular face of the given 
pitched roof. (Fig. 5.41) 


Solution: 


PQ=|6|, PR=| 3 
5 


0 | 
0 9 30 
A=5 6]x| 3 =; 0 = VO 16.2. 
0 5 12 = 


Volume of a tetrahedron 


If S is a further point in space which together with P, Q, and R spans a 
tetrahedron, then we obtain the volume of the tetrahedron with 


V= “PB. (PQ x PR). (5.45) 


The volume of the associated oblique parallelepiped (“spar”) is six times as 
large. 


>>» Application: volume of convex bodies (Fig. 5.42) 

Determine the volume of an arbitrary convex body. 

Solution: 

If the body is not a polyhedron, one must first triangulate it (and thus, 
approximate it with arbritrary precision by means of a polyhedron). Then, 
look for a point S' inside the body (for example, the center of gravity as the 
arithmetic mean of all vertices) and “connect” each triangle PQR of the body 
with S in order to form tetrahedra. The total volume is the sum of all the 
tetrahedral volumes. 

In special cases — such as the pentagon dodecahedron in Fig. 5.43 — we use 
regular pyramids instead of arbitrary tetrahedra. 
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Fig. 5.42 decomposition of a body Fig. 5.43 special case 


® Remark: The body can also be “harmlessly non-convex”. The important thing is 
to a find a point S' so that the tetrahedral facets do not overlap. ® <<< 


Calculating torques by means of the cross product 


Fig. 5.44 torque relative to an axis ... Fig. 5.45... in practice 


>>» Application: torque relative to an axis (Fig. 5.44) 

By means of the cross product, the torque can be calculated. The formula 
Torque = Force x Lever 

is valid according to the ordinary law of the lever. If the force is not perpen- 

dicular to the lever arm, then the torque is calculated with the notation of 


Fig. 5.44 so that the lever equals 


M= F-d = |F\|7|siny = JF x zl. 
—d 
rectangle 
parallelogram 


of equal area 


Note that the order of the vectors in the cross product matters. Reversing 
the order changes the sign. d<< 
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5.5 Reflection 


Reflection in a plane 

Let P(pz,Py,pz) be a point in space and n9-% = c be the equation of the 
specular plane o (7) normalized). Then, according to equation (5.38), d = 
no+p—c is the oriented distance Po. For the reflection P* of P in o, one 
obtains the vector equation 


p = p-—2d7o = p— 2(No-P-c)no. (5.46) 
In R?, one can use Formula (5.46) for the reflection in the line 79 +Z = c. 


>>> Application: highlight point in a plane (Fig. 5.46) 
Which point R (the “highlight”) of a specular tetrahedron o shows the posi- 
tion of the light source from the position of the observer at P? 


Fig. 5.46 reflected point in theory ... Fig. 5.47... and in practice 


Solution: 

One reflects P in o, obtains P*, and intersects the line g = LP* with o, 
which results in R. 

® Remark: In glass panes, the reflection of the Sun is usually quite clear to see 
(Fig. 5.47, left). With reflections in the water, the water surface must be perfectly 
smooth. Otherwise, the reflection is elongated (Fig. 5.47). ® <<< 


>>> Application: strained rope (Fig. 5.48) 
A rope of given length s = 15 is strained between two contact points A and 
B with a weight. Calculate the position of the buckle point C. 


208 Chapter 5: Vector analysis 


Fig. 5.48 strained rope Fig. 5.49 reflection of the ray of light 


Solution: 

C lies on AB* where B* is obtained as the reflection of B, and thus, ¢ = 
a+ A(b* - @). With A(0,10), B(12,6), and AB* = s, the height difference 
equals AB* = V's? — 12? = 9 and, consequently, the height of C’ is &+ = 3.5. 


2 
Then, 
12A x 6.5 
(.2,)=(s) =9= 82 = creer. 


ddd 


If we reflect “only” one vector v in the plane o, we obtain the same result 
by reflecting the plane 7o-% = 0 that is obtained from o by translating it 
parallel into the origin of the coordinate system. It is, therefore, necessary to 
set c= 0 in equation (5.46): 


p* = 8 — 2(r%y-B) «Mp. (5.47) 


>>> Application: reflection of a ray of light (Fig. 5.49) 

At a point R(3/0/0), a mirror is mounted with a prescribed normal direc- 
tion 7(0/1/1). In which direction will the incident solar rays s(1/1/- 2) be 
reflected? 


Solution: 


>>> Application: stereo image with one mirror (Fig. 5.50) 
If one looks suitably at two photos of a scene (one mirrored) in such a way 
that one focuses with both eyes on one photo, say, the right one (with a 
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vertical mirror placed between the eyes), then the scene appears stereoscopic. 
Explain this “space image” effect. 


Fig. 5.50 a stereo image without and with a mirror 


Solution: 

If we view a planar or spatial object ® in a mirror, we see a virtual object ®* 
which is symmetric with respect to the mirror through the “mirror window”. 
So, if we look at the left image in the mirror with the left eye, we no longer see 
a mirror-reversed image. The right eye sees the right — in any case reflected — 
image. Under certain conditions, the associated lines of sight from the right 
eye to the right image and from the left eye to the left image intersect in 
space: The two photographs of the object must have been created so that the 
straight line connecting the two camera positions (specifically with respect 
to the lens centers) were parallel to the photo-plane. This allows us to place 
the photos in parallel so that all of its points P and P% lie on lines parallel 
to the bottom margin of the image. 


® Remark: If you look at stereo images without a mirror, and these, for example, 
should appear separated on the screen, then the spatial impression can be achieved 
by squinting or using optical glasses, which forces the lines of sight to “cross”. ® 

ddd 


Multiple reflections 


>>> Application: specular corners (Fig. 5.51, Fig. 5.53) 

If you look into a mirror, you see yourself “reversed”. In a two-way mirror 
with an acute angle a (Fig. 5.51: a = 60°), you can see how others see you. 
What happens if a = 90° or a> 90°. 

Solution: 

If a = 90°, then the doubly-reflected ray is “exactly parallel to the incident 
ray” (Fig. 5.52). This has the consequence that one eye sees the other, and, 
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e ~~ P=Q 
Fig. 5.51 double reflection Fig. 5.52 a = 90°, billiards 


vice versa. The same applies to the two halves of the face. The distance from 
the mirror does not matter. When looking at the edge of a rectangular double 
mirror, one sees — separated by the edge — both sides of the face, and they 
do not appear mirror-inverted. 

At obtuse angles a > 90° and with increasing distance, you will not see 
yourself when looking into the mirror’s corner. 


® Remark: The above example with a = 90° and P + Q can be interpreted in terms 
of billiards (rail shot). ® <<< 


Fig. 5.53 Confusion results from using three mirror planes, each constituting ap- 
proximately 60°. Since the angles are not accurate, some artefacts come into being. 


>>> Application: reflecting “cuboid corner” (Fig. 5.55) 

The special case a = 90° has an important practical application: Consider a 
reflecting “cuboid corner”, which consists of three concurrent faces of a cube 
(Fig. 5.55). Prove that if one shoots a laser ray into this corner, the ray is 
redirected parallel to the incoming ray. 

Proof: We look at the situation from above to get a top view: There, the reflections 
in the vertical planes appear as planar reflections in the respective normals. However, 
the reflection in the horizontal plane is not seen in the top view because the normal 
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Fig. 5.54 cuboid corner in Fig. 5.55 reflecting cuboid corner 
shipping 


“projects” to the base plane. Thus, following the above (Fig. 5.52) considerations in 
two-dimensions, the incoming ray appears in the top view parallel to the outgoing 
ray. The same holds for the views from the front and right (front view, right-side 
view). So, the outgoing ray is parallel to the incoming ray in space. 


Another proof is derived from the fact that, with each reflection in a coordinate 
plane, the sign of the corresponding component of the direction vector switches. 
Since all coordinate planes are affected, the vector is just reoriented. © 


® Remark: This property of the specular cuboid corner allows us to measure distan- 
ces. Reflecting prisms are, for example, mounted on the Moon to carry out certain 
experiments (verification of light speed and distance measuring). Such devices are 
also mounted on bridges to allow ships to determine their distance from the bridge’s 
pillars. ® 


Fig. 5.56 Reflectors for bicycles consist of many small corners! 


® Remark: Fig. 5.56 shows how reflecting cuboid corners are applied in daily life: 
The reflectors which are mounted on bicycles — viewed under a microscope — consist 
of nothing but small cuboid corners. When they are illuminated by a car, they 
accurately reflect back to the car driver. Along with the rotation, the reflectors 
produce an extremely effective and bright warning. ® daa 


>>> Application: multiple reflections in two orthogonal mirrors 
With multiple reflections in two perpendicular planes, a remarkable effect 


212 Chapter 5: Vector analysis 


occurs, whereby left and right — and not, as in the ordinary mirror, the front 
and the rear — are interchanged. What exactly is happening here? 


Solution: 

For the sake of simplicity (Fig. 5.57), the two planes € and 7 are assumed 
to be vertical coordinate planes with the equations x = 0 and y = 0. Each of 
the planes initially generates a virtual — usually reversed — counterpart 2, 
(change of signs in the x-values) or Q, (change of sign in the y-values). The 
two objects are indirectly congruent and — from the standpoint of perception 
— are of equal rank as the original 2. 


Consequently, 2, has a virtual coun- 
terpart Qz, through reflection in 7, 
and Q, has a virtual counterpart Qy, 
through reflection in €. However, be- 
cause of the special position of the 
mirror planes (€ 1 7), the two new 
§ objects are identical: Q* = Qzgy = Qyz- 
© Furthermore, they can only be part- 
} ly seen in the respective “mirror win- 
dow”. The part labeled with Qyz is 
produced through reflection in the 
right mirror window of the visible 
virtual object Q, in the left mirror 
window. 


a>. 


Fig. 5.57 Several mirrors in two mutually 
perpendicular planes of symmetry 


The two mirror windows are touching each other along the edge s = €U7 
(according to our choice, the z-axis), so they merge into a single window, 
which consists of two mutually perpendicular rectangles. This allows us to 
view our object *. It was created by double-reflection and is, therefore, 
directly congruent to Q. The two directly congruent objects Q and Q* can 
be transformed into each other by a half-turn about the intersection s of 
the mirror planes or by an axial reflection in s. Thereby, left and right are 
interchanged firstly, and the characters are still readable as usual. 

In the photo in Fig. 5.57, a wide-angle lens was used in order to make the 
“auxiliary objects” Q, and Q, entirely visible. 

In practice (also in human vision), the visual angle is smaller, and then, one 
only sees 2* in the image — comparable to an ordinary mirror image, but left 
and right are interchanged. <<< 


>>> Application: What exactly happens between two parallel mirrors? 

Consider the confusion with two parallel mirrors in Fig. 5.58: The photogra- 
pher was obviously able to focus through different distance settings on any 
of the available originals or reflections. Visually, one is unable to distinguish 
between original and virtual objects, so that it is not even guaranteed that 
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the person depicted is actually seen as such — this depends on the focal length 
that is actually used. That is to say, it depends essentially on the aperture 
of the viewing cone. 


x . ; 
LS ss : “a 5 at “a 


Fig. 5.58 Three similar scenes, but in each another reflection is in focus. Can you 
see where “the original” is? 


Solution: 

Let A be the position of the observer (Fig. 5.59) standing between two mirrors 
o1 and 02 (distance d apart). The camera, which is directed towards the axis 
a, can only capture those things that are within the cone of vision A. 


Now, assume that another person is standing (sym- 
bolized by a point P5) between the mirrors, at a 
distance u from a; and at a distance v from aa: 
u+v =d. Thus, two virtual mirror images P; and 
P» arise at the distances u and v behind oj and be- 
hind og. The two points are a distance 2u + 2v = 2d 
apart. P is visually indistinguishable from a real 
point and is reflected in o; to a point P3. This is at 


U : . é 
' distance 2d from Pp. If one continues, one obtains 
. = two congruent series of points Po, P3,... and Pi, 
Fig. 5.59 For each virtual P. A ink iad b fant 
Sai Aaiew oieannesa ene on eh ose points are separated by a constan 
“speed of light”. distance of 2d. 


In Fig. 5.58, these series are clearly visible (in one of them, the person can 
be seen from the front, in another from behind). In both Fig. 5.58 and Fig. 
5.59, the observer and the multiple reflections are not contained in the cone 
of vision. It is noteworthy that the autofocus of the camera has no problem 
focusing on any reflection, which again demonstrates that a purely optical 
mirror image is indistinguishable from reality. The distance to which the 
autofocus adjusts equals exactly that of the “reflected” person in the room — 
and not the distance to the mirror. The repeated reflection causes a loss of 
brightness, so that further apart reflections appear weaker (darker). <<< 
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>>> Application: mirror eyes (Fig. 5.60) 

Nearly all animals with lens eyes bundle light on their retina by means of 
refracting lenses. Some crayfish and shrimp, however, manage this trick by 
using compound eyes and reflection. 


HH 


Fig. 5.60 mirror eyes 


Fig. 5.61 “normal” and “rectangular” pseudopupils 


Solution: 

The underlying principle is purely geometrical: What happens to a light ray 
if it enters a hollow quadratic prism with reflecting faces? The light ray either 
passes through without a reflection, or bounces around between faces until 
it exits the prism. An interesting case occurs when the light bounces around 
between two subsequent — and thus perpendicular — faces: As one can see in 
the top view (Fig. 5.60, right), the incoming ray is parallel to the outgoing 
one in this projection. That is, the first two components of the directional 
vector are swapped. However, the third component (the z-value) is not af- 
fected. Now, we arrange thousands of tiny prisms on a sphere (Fig. 5.60, 
left), and investigate what happens to light rays that arrive from a certain 
radial direction. We consider all prisms whose axes are at a constant angle in 
relation to the incoming light. Their axes are distributed on a cone of revo- 
lution through the sphere’s center. Then, we consider the intersection circle 
of this cone with the sphere. If the prism’s diameter is very small, one might 
estimate that the ray exits the prism if it has been reflected in a single plane 
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through the axis of the prism perpendicular to the projection of the incoming 
ray in the direction of the axis. All light rays that hit the sphere within this 
circle and that are also reflected in an edge of the tiny corresponding prism 
enclose the same angle with the prism’s axis (a generating line of the cone). 
Thus, the reflected rays form another cone with its vertex on the radial ray 
(at approximately half the radius of the sphere). This vertex is a point on the 
retina, which is itself a concentric sphere. The locus of all quadratic facets 
(compounds) that allow at least a certain percentage of light rays to focus 
on the retina is a “spherical rectangle”. Therefore, the “pseudopupil” appears 
to be rectangular (Fig. 5.61, right). <<< 


>>> Application: the viewfinder of an SLR 
In each SLR, one can find an ingeniously simple invention: Through the 
viewfinder, one sees exactly what appears on the sensor when the mirror 
is folded up in the moment of admission, namely upright and not inverted. 
Explain the so-called “pentaprism” (Fig. 5.62). 

ray of light ray of light 


right half-image < left half-image 


s She j a 
mirror ala { | 


> \« image from the mirtor_—— 


sensor 


Fig. 5.62 This shows the operation of the pentaprism. The image that is incident 
to the mirror is “processed” in two different ways. The result is a composite of two 
fields of an upright image, as if you looked right through the lens. 


Solution: 

Through the lens, the light rays reach the sensor. The image is rotated by 
180°, so it seems to be upside-down behind the lens. Standing upright, the 
observer would like to see the image through the viewfinder. Between the 
lens and the sensor, there is a tilted mirror inclined at an angle less than 
45°. It is folded up only at the actual moment of taking a picture. A parallel 
lying mirror would draw the image back to the viewfinder, but in its rotated 
position. This rotation is instead corrected by means of the pentaprism inside 
the camera. The folding mirror deflects the incident light rays upwards in this 
prism. There, a double-reflection takes place. After a total of three reflections, 
the image appears mirror-inverted in the viewfinder. In order to turn it up- 
right in the “last minute”, the pentaprism is inserted (two orthogonal planes 
which deliver the additional two reflections). <<< 
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5.6 Further Applications 


>>> Application: downpour (Fig. 5.63) 

During a downpour, every one of us has asked him- or herself: How fast do 
I have to run to the nearest shelter to remain as dry as possible? Must it be 
“as fast as possible”? 


Fig. 5.63 Run away or endure the downpour? 


Solution: 

First, we move a sphere (radius 7) horizontally by a distance s (velocity 
vector 0], Fig. 5.63, left). The raindrops fall vertically (velocity vector v2). 
If we assume that both motions have constant speed, the sphere can only be 
met by droplets that are located within the volume that is generated by the 
sphere during the relative motion in the direction of ¥j —¥3. This is a cylinder 
of revolution with radius r and the axis direction Ui —~ v5. The projection of 
the axis length in the horizontal plane always equals s, whereby the volume 
of the cylinder (that is proportional to the number of “strikes”) is the smaller, 
the greater the speed of the sphere. So, the motto is: “As soon as possible!” 

Does the shape of the moving object play a role? We consider the simple case 
of a cuboid, and we move it again horizontally by a distance s (Fig. 5.63, 
right). This time, the volume where the various raindrop strikes occur is a 
six-sided oblique prism whose volume is not so easily specified. Let us now 
imagine the following special case: The top of the cuboid is protected against 
the rain. (The green drops shown in Fig. 5.63 (right) have no effect.) We 
move the cube so that only the front side (of area A) becomes wet. Then, the 
space of drops striking the prism is an oblique prism with constant volume. 
(If the drops fall vertically, then the volume equals A x s.) The velocity of 
the object does not matter! Therefore, the shape of the object is probably 
very important. In a downpour, it is recommended to hold a newspaper over 
one’s head when seeking the nearest shelter. When the rain — lashed by wind 
—is incident at an oblique angle, hold it against the wind. 
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® Remark: One thing seems clear: If we do not move, we get wet the most. Yet, not 
even this is self-evident: Mosquitoes and other small insects are rarely, if ever, hit 
by raindrops when flying through the air: Every drop has a “bow wave” in the air, 
and this pushes the insect aside in less than one millisecond! The middle of Fig. 
5.63 illustrates the impact of a raindrop. See also Application p. 484. @ daa 


>>» Application: Why does a shark not sink? (Fig. 5.64) 
The following question seems to be related to that of Application p. 178: 
Why does an airplane fly? Surprisingly, the answer is very similar. 


Fig. 5.64 An oceanic shark bears a striking resemblance to an aircraft. The pectoral 
fins work like the wings of an aircraft. 


Solution: 

Big sharks like the oceanic whitetip shark (Carcharhinus longimanus) have a 
density greater than water. So, if the shark stops swimming, it will sink. As 
it swings, the stiff pectoral fins, which are slanted upwards, generate lift like 
an aircraft wing. Since the pectoral fins are positioned before the animal’s 
center of mass, the generation of lift results in a torque that tilts the shark’s 
head upward. 

To repeat it again (this time adapted to the fluid water): The sharp edges 
of the pectoral fins create vortices that, according to the law of conservation 
of angular momentum, generate counter-rotating vortices around the profile 
(speed |Av|). The vector sum VU + Av, therefore, has greater magnitude at 
the top than at the bottom. 

Because of the hydrodynamic paradox, negative pressure is generated on the 
side where the water flows at greater speed, which will, in turn, produce a 
buoyant force that causes the shark to ascend. 

The shark’s asymmetrical dorsal fin, however, simultaneously produces a tor- 
que that tilts the head downwards. Both torques compensate each other, and 
thus, it is possible for the shark to swim straight ahead without sinking. <«<« 
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>>> Application: centroid of a quadrangle (Fig. 5.65) 

Consider the quadrangle ABC'D. Compute its centers of gravity with ho- 
mogeneous mass distribution (areal centroid), as well as with equal masses 
centered at the four vertices. 

Solution: 

We divide the quadrangle into two triangles ABC and ACD. For these, we 
calculate the centroids S; and Sz and the “masses” (= areas) m, and mz. 
The (areal) center of gravity S is computed with Formula (5.15): 


7 1 . . 
§ = ———(m)§] +252). 
Mm, +mg2 


The (ordinary) centroid is simply the “average” of the vertices 


t o 7 
Sp = 7(G+b+é+d). 


Of course, this point does generally not coincide with the areal centroid. <<< 


B 


Fig. 5.65 center of gravity of Fig. 5.66 positive or negative buoyancy 
a quadrangle 


>>» Application: positive or negative buoyancy (Fig. 5.66) 

A hull sinks into water, and depending on the weight of the ship, it will go 
down to a certain level. How far does it sink? Which forces occur, and when 
are they balanced? What will happen if the body topples over? 


Solution: 
We simplify the reasoning by only looking at a typical cross-section of a 
ship. The implementation of the following calculations, of course, is left to a 
computer. 


e One can now consider the weight force acting at the center of gravity (Fig. 
5.66, left). For the computation, we triangulate the cross-section. Then we 
use Formula (5.16) and the weight is proportional to the total area. 


e According to Archimedes, the buoyancy is equal to the weight of the dis- 
placed fluid. The force of gravity acts on the center of gravity of the 
displaced water (Fig. 5.66, right). We intersect the partial triangles of the 
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cross-section with the surface of the water. If a triangle is completely out- 
side the water, it does not count. If it is completely under the water, the 
area and centroid remain unchanged. For a triangle that intersects the 
surface of the water such that the part under the water is again a trian- 
gle, one can immediately calculate the area and the centroid. Otherwise, 
the part underwater is a quadrangle, which can be decomposed into two 
triangles. 


e When the buoyant force is greater than the weight force, the ship is lifted. 
If the centers of gravity are not lying precisely above each other, they 
induce a torque. The cross-sections of vessels are designed so that the 
ship is automatically lifted “upright”. Observe this in the demo program 
on the website that accompanies this book. See also Application p. 346 
and Application p. 347. “= 


>>> Application: Cardan (universal) joint (Fig. 5.67) 

We want to study the transmission of a rotation about an axis a to a rotation 
about an axis b intersecting a. A fork with shaft a includes a universal joint, 
which, in turn, drives a fork with axis b. The angle between the two axes is 
y (0°<7< 90°). 

“Input angle” a and “output angle” 6 are obviously different. 


Fig. 5.67 universal joint 


Let us first consider the construction of a general configuration: We assume 
that the axes a and 0 lie in a horizontal plane. From the front view and the 
side view, the following relation can be derived: 


tan 6 = tana. (5.48) 


cosy 
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The construction is not trivial and requires solid knowledge in the field of 
Descriptive Geometry. Now, we solve the problem by means of vector analy- 
sis: 

We associate a Cartesian coordinate system with the joint (ay-plane = ab- 
plane, a is the z-axis). The fork has a radius of 1. In the initial position, 
the end points A and B of the Cardan joint are located such that A has the 
coordinates Ag(0/1/0) and B has the coordinates Bo(0/0/1) (Fig. 5.67). 
Now, A is rotated about a (z-axis) by the given angle a which gives 


A(0/cosa/ sina). 


In order to determine the new location of B, we apply two superimposed 
rotations: 


1. The rotation about a through the unknown angle 6 moves Bp to the point 


B(0/sin B/ cos 8). 


2. The rotation about the z-axis through the angle —-7y assigns to B the final 
coordinates 


B(sinysin 8/ - cosy sin G/ cos B). 


The position vectors of A and B enclose a right angle. So, the following holds 


0 sin ysin 8 
cosa]-|-—cosysin 6 | = 0. 
sin a cos 3 


This leads to the condition 
—cosacosysin 3 + sina cos 2 = 0, (5.49) 


which is equivalent to (5.48). <<< 


>>> Application: Where does the Sun rise and set in the solstice? 

Stone Age monumental buildings such as Stonehenge (Fig. 5.68, left) and 
Chichén Itza (Fig. 5.68, center and right) are aligned over many hundreds of 
meters up to tenths of a degree precisely in the direction of the sunrise or 
sunset on June, 21st. 

At that time, the corresponding directions were determined empirically as the 
minimum deviation of the rising or setting sun from the northern direction. 
The directions were measured so accurately that one can draw the conclusion 
that, 4,500 years ago (Stonehenge), the inclination angle 6 of the Earth’s axis 
deviated by more than half a degree from the current angle of 6 = 23.44°. 
Investigate the corresponding directions for a given latitude by means of 
vector analysis. 

We use two simple “fit-for-everyday-use” and obvious lemmas. 


5.6 Further Applications 221 


Fig. 5.68 Stonehenge (left) and Chichén Itz (middle and right 


Proposition 1: You can find the North Star, and thus the direction of the 
Earth’s axis, by looking to the north at an angle of elevation y equal to the 
current latitude. At the Chephren Pyramid, one had to look towards the 
pyramid’s vertex from a point O which is south of the pyramid’s base at a 
distance equal to the pyramid’s height h in order to see the ancient North 
Star Thuban (Fig. 5.69). 


Proposition 2: In the course of a day, the Earth rotates once about its axis. 
That is, relatively to the Earth, the Sun revolves around the Earth’s axis. 
Especially at the solstices, the angle enclosed by the Sun’s rays and the 
Earth’s axis remains nearly constant. So, the light rays sweep out a cone of 
revolution. At the summer solstice, this angle is 0 = 90° — 0, i.e. currently 
~ 66.6° (4,500 years ago it was ~ 66.0°). 


z 


Fig. 5.69 Calculating the direction of the sunset (actually carried out at the Pyra- 
mid of Chephren at the summer solstice 4,500 years ago). 


Now, we define a Cartesian coordinate system with origin O and the horizon- 
tal zy-plane with the y-axis pointing in the northern direction (Fig. 5.69), 
so that the Earth’s axis lies in the yz-plane inclined under the angle y and 
has the aforementioned normalized direction vector @ = (0,cosy,siny) with 


222 Chapter 5: Vector analysis 


y as its latitude. 
Let w be the angle between the direction of the setting Sun and the North. 
The corresponding normalized direction vector then equals § = (sin w, cos w, 0). 
The angle o between the two directions satisfies cosa = @-§5, resulting in the 
simple condition 


cosa = cos ycos w. (5.50) 


Specifically, one obtains the geographic coordinates of Stonehenge (approxi- 
mately 4,500 years old) y = 51.2°, 0 = 66.0° or Chichén Itza (ca. 1,000 years 
old) y = 20.7°, o = 65.4°. The respective values for y are x 49.5° or » 63.6°. 
The result can be verified in the images provided by Google Earth (see Fig. 
5.68, center). << 


Fig. 5.70 A right triangle on the sphere shows a simple relation. 


>>> Application: the spherical Pythagoras 

Consider the unit sphere centered at the origin O in Fig. 5.69. Then the 
angles w, vy, and o correspond to lengths a, b, and c of the arcs on the 
great circles (Fig. 5.70) forming a spherical right triangle ABC. Moreover, 
equation (5.50) corresponds to the formula for the Pythagorean Theorem on 
the sphere: 


cosacosb = cose. 


Switching from spherical to Cartesian coordinates 

On a sphere (e.g. the Earth), points are often given in spherical coordinates 
A (the longitude) and y (the latitude). This is quite practical for beings 
living on a sphere, since we only need two numbers to know our location. 
For calculations with vectors, however, it is sometimes necessary to switch 
to Cartesian coordinates. 

Let a point be given by latitude and longitude: P(A, y). The radius of the 
sphere shall be r. According to Fig. 5.71, we immediately have 


x=rcosAcosy, y=rsinycoosy, z=rsiny. (5.51) 
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Fig. 5.71 switching from spherical to Cartesian coordinates: P(A, y) > P(2,y, z) 


>>> Application: shortest distance between two points on the Earth 


Solution: 

Let r = 1 for the time being and the two points be given by their spherical 
coordinates A(A4, ya) and B(Ag, yz). Then, the two points have — accor- 
ding to (5.51) — the Cartesian coordinates A(x4,ya,z4) and B(xg, yp, zp). 
Their coordinates can be interpreted as the components of unit vectors 
(length 1) @ and b. They enclose the angle 7 = arccosa-b. 

The shortest path AB from A to B is part of a great circle (Fig. 5.72, left). 
All great circles have the same circumference as the equator (40,000 km), 
and we have AB = 40,000: /360°. 

Numerical example: 

A... Anchorage (N61.2°, W149.9°) => a = (-0.417, -0.242, 0.876), 

B... Baltimore (N39.3°, W76.6°) => b = (0.179, —0.7528, 0.633), 

w = 48.535° > AB = 40, 000 - 48.535°/360° » 5,400 km. <<< 


>>> Application: initial course angle, when flying from A to B 


Solution: 

This application is a classical application of vector calculus. It is quite useful 
and must, of course, be implemented in any navigation system, especially on 
ships and aircrafts. 

Let again r = 1 and A(Ay4,ya) and B(Ag, yg) be two points on the unit 
sphere with corresponding unit vectors @ = (x4, ya, 2a) and b = (xB, yp, Zp). 
Again, the shortest path AB from A to B is part of a great circle (Fig. 
5.72, left). Its carrier plane y is defined by the three points A, B, and the 
center of the sphere (the origin of the coordinate system). Its normal vector 
is perpendicular to both a@ and b, and it is, thus, given by g = Gx b. We have 
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Fig. 5.72 Left: great circles as shortest paths, right: the course angle from A to B. 


to intersect y with the tangent plane in A with normal vector @ (Fig. 5.72, 
right). The direction vector ¢ of the intersection line is given by t = gx @. The 
north direction in A is given by 7 = @ x (-ya, 2 ‘Ay 0). The course angle 6 can 
finally be determined after normalizing 7 and ¢ via cos6 = fig «to. 

Numerical example: 

For A and B, we choose again Anchorage (N61.2°, W149.9°) and Baltimore 
(N39.3°, W76.6°) > @ = (0.417, -0.242, 0.876), 6 = (0.179, -0.753, 0.633). 
Then, we have g = (0.507, 0.421, 0.357), 719 = (0.758, 0.440, 0.482), and tp = 
(0.608, -0.791, 0.071). This yields cosé = 0.147 and 6 = 81.5°, which corre- 
sponds more or less to the direction EbN, but eventually leads southwards. 
That is to say, the airplane first goes slightly north, though the final desti- 
nation is 22° further south! <<< 


>>> Application: calculating the Sun's position in the generic case 

Making the following simplified assumptions, we calculate the incidence of 

sunlight in a given latitude y, time z and date: 

1. Time is taken to be “true solar time”. The Sun reaches its culmination at 
exactly 12” noon. 

2. The Earth’s orbit is approximated by a circle (the maximum deviation of 
the orbital ellipse is below 1.5% of the circle’s diameter). 

3. The Sun’s rays are assumed to be parallel because of the Sun’s distance 
from the Earth. 

4. Each month is said to have 30 days (year = 360 days). Then, the Earth 
rotates by 1° around the Sun per day. 

Calculate and discuss the direction vector to the Sun in: 

a) Vienna (y = 48°) on December 21st, 1620; 
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fe 
Fig. 5.73 where the Sun can reside (Equator or the Arctic Circle) 


b) Nairobi (vy = 0°) on May 20th, 12” or 18"; 
c) Rio de Janeiro (vy = 23° south) on June 21st, 12”, 
using the following formula: 


0 0 sin w 
S=cl|cosy]+cosw|-sing|]+] 0 |. (5.52) 
siny cos p 0 


Fig. 5.74 sunsets that are one month apart 


The angle of rotation from the south direction is (with z as the true solar 
time) w° = 15°(12 - z). The day-specific constant c is calculated from the 
rotation angle a in the “orbit circle” of the Earth: c = tan(23.44° sina). In 
the accompanying coordinate system, the east direction is the x-direction, the 
north direction coincides with the y-direction, and the z-axis always points to 
the zenith. In this system, the Pole Star lies in the direction (0, cosy, siny).! 


Solution: 
a) We have a = -90°, z = 16.33, and » = 48°. For this typical Central Euro- 
pean latitude and at winter solstice, one would expect that the Sun has 


already set. However, the calculation shows that the Sun sets relatively 


1More on this can be found in Geometry and Its Applications. 
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close to the WSW direction § = (—0.91, -0.57, 0). This is because the Sun 
has already started to descend before 12am Central European winter time 
due to its irregular orbit (but we compute with the true time). 

b) At the Equator, only 12-hour days are to be expected in the course of 
a year (Fig. 5.73, left). For a * 60°, the Sun at noon is in the direction 
(0, 0.37, 1), i.e. at an elevation angle of almost 70° in the North (and 
not, as one might suppose naively, at the zenith). At 6pm, it sets towards 
the WNW direction (-1, 0.37, 0). 

c) In the southern hemisphere, we do not see the North Star. However, at the 
elevation angle of the actual southern latitude, we see the Southern Cross. 
On June, 21st (a = 90°), the Sun reaches an elevation angle of slightly less 
than 45°: (0, 0.79, 0.75) and is located in the north. 


Fig. 5.75 sunrise in the temple of Abu Simbel 


® Remark: Fig. 5.73 shows that the Sun actually sets in the west at the solstice. 
The photomontage in Fig. 5.74 shows two sunsets one month apart. ® 


® Remark: The ancient Egyptians had a profound knowledge of the positions of the 
Sun. Fig. 5.75 shows a sunrise at the temple of Abu Simbel. Only for two days of 
the year, the leftmost nearly hidden statue of the God of Darkness is illuminated 
in the inner sanctum. ® << 


>>> Application: energy input by solar radiation 

Formula (5.52) allows us to calculate the normalized direction vector § poin- 
ting towards the Sun. According to Lambert’s law, the brightness, and thus 
also the energy input, is proportional to the cosine of the angle of incidence 
(Application p. 198). Using a computer program in order to sum the energy 
input values at certain latitudes and at short time intervals, say, every five 
minutes, every day, we can determine the corresponding theoretical energy 
input with relative precision. Interpret the diagram in Fig. 5.76. 

Solution: 

We assume that the program presumes clear blue skies at each day and at 
each latitude. 

Let us first consider the energy input at the Equator. As expected, it varies 
only slightly within a calendar year. 
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| North Pole 
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Fig. 5.76 theoretical solar energy input on the northern hemisphere 


At the North Pole, there is no energy input in the winter time. Yet, there is 
quite a lot during the summer months. In fact, during the period from late 
May to late July, the input exceeds that at the Equator! The explanation 
for this: The Sun is at the North Pole on June 21st at 23.44° above the 
horizon, and this will last for 24 hours a day. At this time, the Sun’s elevation 
angle at the Equator fluctuates between 0° (in the morning and evening) and 
66.56° = 90° — 23.44° (during lunch), and it only does so for 12 hours a day. 
In the summer, the temperate latitudes also have more energy input than 
the region at the Equator (but less than the North Pole!). In the winter time, 
the supply is significantly lower than in the tropical belt. <<< 


>>> Application: illumination of a telephone box (Fig. 5.59) 


A point-shaped light source L emits a light cone 

with vertex L (spotlight). This cone intersects 

an inclined plane ¢ along an ellipse e. The sur- 

face of the elliptical area is specular (comparable 

to an oblique section of a cylinder of revolution). 

\ The reflected rays now form a cone of revoluti- 
on (spotlight) through e, with vertex L* which 
is the reflection of LZ in ¢. Here, the center of 
the keyboard (dial) should be on the axis of the 
reflected spotlight. 


Pa P Y 


ddd 


>>> Application: solar power plant (Fig. 5.77) 

With the help of mirrors, light rays are bundled at a center (focal point). 
Let 5 = (4,0,3) be the direction of the Sun; § is to be redirected from the 
point P(6,8,0) to the focal point F'(0,0,24). What is the normal vector of 
the tiltable mirror plane? 

Solution: ay 

The new direction of the light ray is s* = PF = (-6,-8,24). We form both 
the two unit vectors so = 2(4,0, 3) and 55 = 75 (-3, —4,12). Their sum 8 + 89 
is the normal vector of the desired plane. <<< 
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R 


Fig. 5.77 solar panel Fig. 5.78 a sunny terrace in a northern 
location 


>>» Application: morning sun despite the north exposure (Fig. 5.78) 

At point H, there is a heliostat (a mirror rotated by clockwork toward direc- 
tion of the North Star). The light ray § is, thereby, reflected through a fixed 
point R on a tower. The ray HR points to the north in the top view and is 
inclined at y° (latitude) to the base plane. At the point R, the light ray can 
be reflected to an arbitrary point P. The (fixed) normal direction 7 of the 


mirror in R is the angle bisector of RP and RH. 


® Remark: The village Viganella in Piedmont used to be known as the “darkest 
village of Italy”, because there was no Sun for almost three months due to the steep 
mountains around. However, in 2006, a computer-controlled mirror with a surface 
of 40 m? size was mounted 400 meters above the village to redirect the Sun’s light 
exactly to the piazza during the winter months. ® daa 


>>> Application: rotation about a generic line (Fig. 5.79) 
Two aircraft wheels are to be moved by rotation from the operating position 
into the aircraft body. Calculate the rotation axis and the rotation angle. 


Fig. 5.79 rotation of aircraft wheels in theory and practice 
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Solution: 

Let M, be the center of the wheel in the starting position and A, a point on 
the wheel axle. In the final position, the wheel center is at Mp and the axis 
point has reached A». Then, the axis of rotation d is the intersection of the 
bisectors (planes of symmetry) o,y and o4 of M,Mg2 and Aj Ao. 

The circular orbit of the wheel’s center is located in the normal plane v 
(center M) to d. The rotation angle a is the angle MM Mg. 


In order to practice, we will compute a numerical example in detail: For exam- 
ple, suppose the centers are M;(3/0/0), M2(1/2/2) and the points on the axes are 
A, (4/0/0), Ao(1/2/1). 


1. To determine the plane of symmetry, we use Formula (5.27), according to which 


oy ee ete ees r 
aa: (a2 di) B= 3(@" a,”) => 


-3 
1 
-( ») 2-40 16)=-5>o0,4: —3x+2y+2=-5, 
1 


ae kh es bps Pr 
OM: (Mig M1) -% = 5 (Min ri”) => 


-2 
1 
>| 2 E= 5(9 9)=O0>o0m: -2u+2y+2z=0. 
2 


2. The axis of rotation is the intersection of a,j and oy, i.e. d = on, NOY. The 
position of the point P € d with z = 0: 


2a + 2y =0A-34+2y=-5 > @=5, y=5 => P(5/5/0). 


The direction vector 7 of d is normal to 72-7 and @2-—@1. Thus, we find 


-2 -3 -2 5 -2 
f= 2 |x 2 |=] -4 |, andd: “= 5 J+t] -4 
2 1 2 0 2 


3. The normal plane v 1 d, v3 M, (and automatically v > M2) has the equation 


= -2 3 -2 
feG=7.-M, =] —-4 0 |=-6> v: -4 Z=-6 
2 0 2 


4. Intersecting the normal plane v with the axis of rotation yields M =vnd: 


lores 
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——— as 
5. The angle of rotation a is measured between MM, and M Mo: 


3 OM eae: ape MM,-MM, 1 
MM, =| -1 |, MMo=| 1 |=>cosa= 2 == San 101.54". 
=) 0 J5- V5 ) << 


>>> Application: angles enclosed by the heights of a regular tetrahedron 
In nature, one hardly finds exact mathematical forms. More often, we find 
approximations. Among the few exact examples, there are various crystal 
lattices, such as that of diamond or germanium (perfect tetrahedral), as well 
as table salt (cubical). Designers also apply these highly symmetric forms 
in original ways. In order to produce the lamp shown in Fig. 5.80 (left), 
determine the angles enclosed by the altitudes of a regular tetrahedron. 


Fig. 5.80 designer lamp based on the altitudes of a regular tetrahedron 


Solution: 
We start with the angles formed by the diagonals of a cube (Fig. 5.81, midd- 
le): 


Fig. 5.81 The three diagonals of an octahedron trivially form an orthogonal tripod 
(left). The four diagonals of a cube do not form right angles. 


Assume the cube is centered at M(0/0/0), and its vertices are given by the 
coordinates A(1/-1/-1) (position vector @), B(1/1/-1) (position vector b) 


, 
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etc. Then, we build, for example, the direction vectors MA =a and MB=b 
of the diagonals, and the angle y is determined by 


cos = =-. 
. 3 


All other angles are either equal or, due to symmetry, they complement each 
other to 180° (cosy = -1/3). 

For the tetrahedron (Fig. 5.81, right), computation is not as easy at first, 
because the coordinates of the vertices “in the standard position” are com- 
plicated to write down and the direction vectors of the altitudes cannot be 
read off so easily. 


A much simpler coordinate representati- 
on of the vertices P, Q, R, S' of the tetra- 
hedron is obtained by “carving” the te- 
trahedron out of a cube as shown in Fig. 
5.82. We have already assigned coordina- 
tes to the vertices of the cube. The cal- 
culation of the angle between the heights 
needs no additional work, and we get 
“Q cos y = -1/3, thus, y » 109.5°. 
Fig. 5.82 tetrahedron in the cube 


At this point, we shall try to determine the angle subtended by the two faces 
of a tetrahedron. For this purpose, we calculate the angle between the faces’ 
normals. The interior angles are, therefore, supplementary angles to y (and 
hence — precisely up to two decimal places — 180° — 109.47° = 50.53°). «<< 


>>> Application: dihedral angle on an icosahedron (Fig. 5.83) 

Among the five Platonic solids (tetrahedron, hexahedron = cube, octahedron, 
dodecahedron, icosahedron), the icosahedron has the largest number of faces 
(Greek: ikosi = 20). What is the angle between two faces? 


Solution: 

We can fairly easily sketch an icosahedron as follows: We draw a harmonic 
rectangle (sidelengths 1 and ® = (1+ /5)/2 * 1.618, Application p. 37) 
in the zy-plane and then redraw it two more times as shown on the left- 
hand side of Fig. 5.83. The twelve points of the polyhedron then have the 
coordinates (0,+1,+®), (+1,+®,0), and (+6,0,+1). The “convex hull” of 
these points is the desired icosahedron. Now, consider the midpoint A(0, 0, ®) 
of the uppermost edge, and B(0,®,1) as a neighboring vertex. The angle a 


of the vector AB to the negative z-axis is half the dihedral angle of the 
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Fig. 5.83 how to build an icosahedron 


icosahedron. We have 


0 0 
cos a = : ® |-| 0 ]= esl >arx 69.1° 


62+ (1-0)2 _ = 62+ (1-6)2 << 
1-©@ 1 


>>» Application: Artists and astronomers (Fig. 5.84) 

The wonderful Stone Age paintings of Lascaux are some 17,000 years old. 
The Hall of the Bulls is very famous. Besides the artwork, there are dots 
on the walls, and these dots were most likely painted there on purpose. In 
particular, the pattern of the Pleiades appears several times. This is the case 
in many other ancient findings: The Navajo Indians also painted the same 
patterns, and on the sky disc of Nebra, the Pleiades are quite dominant. For 
the Mayas as well, the Pleiades were of extreme importance. Could Fig. 5.84 
also be a star-map? 


Solution: 

Taurus spans the 30-60th degree of the zodiac. The Pleiades are presently 
in the same position with respect to the other dominant star of the zodiac, 
Aldebaran. Furthermore, to the left of Aldebaran, there is the prominent 
Orion belt — three stars that more or less point to the Pleiades. So far, 
so good. Yet, unfortunately, the artist painted four dots in the Orion belt, 
instead of three. Does this mean that the speculation was wrong? 

The first question is: Did the sky 17,000 years ago resemble the sky today? 
Most stars stay rather stable in their relative position over many thousands of 
years (Application p. 233), because they are so far away that any — even fast 
— motion is almost negligible, especially when they have similar distances. 
Some closer stars like Sirius, however, move comparatively fast. So, one has 
to check whether the mysterious fourth star might have been Sirius. Indeed, 
it turns out that Sirius was closer to the Orion belt at that time, but — 


5.6 Further Applications 233 


Fig. 5.84 The Hall of the Bulls in Lascaux: Is it also a map of the nocturnal sky? 
The Pleiades (encircled in green) are depicted on many ancient paintings all over 
the world. Photo: Michael A. Rappengliick, 2012. 


unfortunately — not as close as one might have hoped. In order to conclude 
this discussion: The supposed star may still have been a comet (we know too 
little about comets at that time). Neither should we underestimate the skills 
of the artists. They knew exactly what they were drawing. Picasso, who was 
one of the first to see the discoverd paintings, declared, “We have learned 
nothing since then”. <<< 


>>> Application: fixed stars or not? (Fig. 5.85) 

Stars like our Sun and all visible stars on the firmament move quite fast with 
respect to the center of the galaxy, for instance, or also with respect to our 
Sun. However, since they are so far away from us, this will make no difference 
to the celestial map during a human life span. Within tens of thousands of 
years, there are some noticeable changes. Star clusters that are comparably 
far away from us will barely change their mutual positions on the sky. Some 
stars, like Sirius — “only” dg = 8.6 light-years away — move differently, since 
they are much closer in comparison. It has a relatively high “proper motion”. 
Where was Sirius at the time when the artist in Lascaux was watching the 
sky (Application p. 232)? 
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Fig. 5.85 Sirius on its way over the nocturnal sky ... 


Solution: 
From tables found on the Internet, we get the following data for Sirius: 
e radial velocity R, = —5.5 km/s. 


Sirius decreases its radial distance to our Sun by 5.5 km per second. 
e Right ascension a = —546 mas/yr, 

i.e. longitude difference on the celestial sphere: 
e Declination 6 = —1, 223 mas/yr, 

i.e. latitude difference on the celestial sphere. 
The abbreviation mas stands for milliarcsecond. 
Thus, 1 mas = 1/1,000-1/3, 600°. To convert into radians, we have to multiply 
by 7/180: 1 mas » 4.85- 107%. Thus, we have (measured in radians) @ 
0.00000265 and 6 x 0.00000593. 
Let us take 1 light-year (1 Ly » 30- 10° s- 300,000 km/s » 9.5-10!? km, 
Application p. 13) as our length unit. Then, the distance of Sirius is dg = 8.6 
and the radial speed R, ~ 0.000018 Ly/s. Sirius’s current position shall be 
defined by the direction Sp = (1,0,0) (Cartesian coordinate system with our 
Sun as the center). Provided that Sirius moves on a straight line, its position 
in t years is given by the vector 


dg R, 
5(t) =| 0 |+t]ds-q@ 
0 dg-6 


In order to measure angles, we have to normalize s(t) (> so(t)). The angle 
y(t) between those two vectors is given by cos y(t) = 59+ 5o0(t). 

For t = -17,000 years (Lascaux), Sirius was A * 6° off the current position 
(in the direction towards Orion’s belt). This is not close enough to justify 
the claim that the artist might have seen Sirius as part of the belt (the 
distance is three times as large). Of course, one must now also consider the 


5.6 Further Applications 235 


relative motion of the belt stars. They are, however, so far away that the 
corresponding angles are much smaller. If the painter would have lived some 
40,000 years earlier, the hypothesis that Sirius is the fourth star in the belt 
would have been a good theory. <<< 


>>> Application: How many points, lines, and planes are in three-space? 
This question may seem strange at first. The answer is, of course, “infinitely 
many”. Let us now investigate whether there are different categories of “infi- 
nite”.2 By oo!, we denote the number of points on the real line. Then, we can 
also say: There are oo! real numbers. The position vector of a point in space 
is uniquely defined by three real components. For each component, we have 
oo! possibilities. Thus, there are, in our terminology, 00% 
e co® points in space. 

Now we want to determine the number of unit vectors in three-space. So, we 
impose the condition ee ee = 1lona vector Ai(nz, ny, nz), and it becomes 
a unit vector. Therefore, we can prescribe only two components nz and ny. 


The third component is generated (aside from the sign) as nz = \/1-—n2—n2. 


Thus, there are “only” oo? unit vectors in three-space. Interpreted as position 
vectors, these co? vectors describe the points on the “unit sphere”. 


vectors and also 


Now for the planes: They are uniquely determined by a unit normal vector n 
(co? possibilities) and a real constant c (oo! possibilities), because 71-% = c is 
the equation of a plane. We can, therefore, define a plane by specifying the 
three real numbers nz, n,, and c. Thus, there are 

e as many planes as there are points in three-space, namely 00”! 

Still, the lines are to be counted. Are there as many, more, or fewer lines 
than points in three-space? 

A straight line g is uniquely determined by a point P (co? possibilities) 
and a normalized direction vector ig (co? possibilities). That would be an 
inconceivable number of co lines. On the other hand, the point P can be 
chosen freely on the line g (oo! possibilities) — g remains unchanged. Thus, 
there are only 

e co’ straight lines in three-space! 

® Remark: This sounds less dangerous than it is. One must first rephrase: There are 
infinitely more straight lines in space than there are points or planes. This is the 
reason why it is so hard to imagine a rotation in space about a “generic” line. Recall 
the example of the aircraft wheels (Application p. 228) or the fact that very few 
people know that the position of the Sun rotates about the imaginary axis through 
the Polar Star during a day (Fig. 5.73). Furthermore, in Descriptive Geometry, two 


?On such counting questions, one could give a whole lecture, and one easily falls into fallacies. 
In the end dimensions of manifolds (for example, all lines in space) are to be determined. It 
is possible to find a one-to-one and invertible mapping that assigns to each point in the plane 
precisely one line in the plane. In order to be precise, instead of posing the question “how many 
points”, one should ask instead “how many parameters” determine the points — this number 
equals the dimension of the manifold. 
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side views are necessary in order “to cause a line to project”, i.e. to make it appear 

as a point. In contrast, with only a single side view, you can make each plane project 

in such a way that it appears as a straight line! 

The advantages of vector analysis become evident in the treatment of the straight 

lines in space: Both the auxiliary point on the line and the direction of the line are, 

indeed, described by vectors, namely by a position vector and a direction vector. ® 
ddd 


>>> Application: How many points and lines are in the plane? 

With the considerations of Application p. 235, we obtain: 

Points in the plane are described by position vectors with two components. 
So, there are 

e co” points in the plane. 

There are only oo! normalized unit vectors, namely the position vectors of 
the points of the “unit circle”. 

Lines in the plane are — like the planes in three-space — defined by an equation 
of the form 7i9-% = c. So, they are specified by n, and c. Thus, there are 

e as many lines as points in the plane! 

The lines in the plane somehow correspond to the planes in three-space. 


Fig. 5.86 Consider all straight lines determined by a fixed point P and an arbitrary 
point in a horizontal plane @ (P not in 3). There exist as many lines as there are 
points in this plane, because any point in 6 can be joined with P. 


® Remark: Fig. 5.86 illustrates that through each point in space we have oo? straight 


lines and that we already have oo® straight lines that meet a single straight line. © 
ddd 


6 Functions and their derivatives 


SON 
ib 


In very generalized terms, a real function y = f(x) can be understood as a 
rule for assigning a real number y to a real number x. Some functions play 
a central role in mathematics and their numerous applications, as well as in 
nature. Among others, these are the power functions of the form y = x”; the 
trigonometric functions y = sinz, y = tanz, etc.; the exponential functions 
y = a”; and also their inverses. 

Besides elementary functions, which are given with mathematical exactness, 
the so-called empirical functions also play an essential role in applied mathe- 
matics. They are mostly given by a number of measurement points (sampling 
points), and intermediate points are interpolated. 

When solving problems of a physical, technical, or geometric nature, it is 
often necessary to employ the “derivative function” y’ = f’(a) of the original 
function. The derivative is computed by means of simple rules of differential 
calculus, and it is indispensable in the solution of many problems. 

In the age of computing, digital techniques are not only used to display func- 
tions, but also to differentiate them. This often leads to numerical problems, 
which will be discussed in this chapter. In general, the use of computers 
makes great sense, and it even allows us to solve so-called transcendental 
(non-algebraic) equations with sufficient precision. 
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6.1 Real functions and their inverses 


In order to get acquainted with the problem, let us first discuss a diagram 
showing a series of measurements. 


>>> Application: amount of lactate in blood related to physical work 
The lactate value describes the relation between the concentration of lactic 
acid (lactate concentration) in blood and the intensity of physical work of a 
human body (Fig. 6.1). 


Lactate values in blood during physical 


work: During each motion, muscles pro- 


lactate [mmol/l] the lactat trati F 
| [| in the blood increases duce both energy and lactate. Persistent 


with increasing exertion 


intense physical work causes over acidi- 
lactate-profile fication of the muscles due to a lack of 
sufficient oxygen supply. At a lactate va- 


anaerobic threshold 


lue of up to 2 mmol per liter, the orga- 
transition 


aerobic threshold 


nism is working “aerobically”, receiving 
a large portion of the required energy 
through the oxygen-intensive burning of 
body fat — without depleting the valua- 
ble glycogen storage of the muscles and 


intensity 
Fig. 6.1 lactate values ... the liver too quickly (which can only be 
replenished over longer periods of time). 


In the “anaerobic range”, the body increasingly needs to make use of these energy 
storages. Once the glycogen is fully depleted, the body becomes completely slack. 
Many long-distance runners experience this phenomenon if they do not hold back 
their tempo during the initial phase of their run, which causes them to reach the 
anaerobic phase too soon. On the axis of abscissas (usually the x axis), we notate 
the intensity of the physical work. On the axis of ordinates (usually the y axis), we 
notate a sufficient quantity of measured lactate values and connect the data points. 

ddd 


After this “warm-up example”, let us take a moment to clarify several terms 
so that we may approach the next considerations more exactly. 

The real numbers R are the set of all “decimal numbers”. Calculators and 
computers, for instance, usually work with finite decimal numbers — in other 
words, with a mere subset of the rational numbers and with functions that 
act on them. 

Real numbers may be visualized on the real line, as the points on a straight 
line correspond exactly and unambiguously to real numbers. A segment cor- 
responds to an interval of real numbers. The set {2| a < x < b} is called a 
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closed interval [a,b], and the set {2| a < x < b} is called an open interval 


Ja, bf. 


Definition and graph of a function 


A real function y = f(x) defined on [a,b] describes a rule that assigns a single 
real number y to each real number z € [a,b]. 

The variable x is called the independent variable or the argument, and y is 
called the dependent value or the function value. The interval in which x is 
defined is called the domain D ({2} = [a,b]), and the interval onto which z is 
mapped is called the codomain B. ({y} = [c,d]). 


The function value can be the result of a mathematical calculation or simply 
a measurement value. 

Let us return once more to our “warm-up example”, and this time, let us be 
a little more precise: 


>>> Application: lactate concentration as a function graph (Fig. 6.1) 

In the diagram Fig. 6.1, x represents the intensity of physical work done 
by the body, which may be rather subjective, but which can, nevertheless, 
be approximated by measuring the current heart frequency. Naturally, this 
intensity is subject to constraints, depending on the age and physical fitness 
of the athlete in question. Each heart frequency can be associated (in a 
relatively constant correlation) with a lactate value in the bloodstream. This 
means that an athlete who knows his or her individual “lactate function” may, 
with considerable accuracy, infer the current lactate concentration from the 
current heart frequency. 


® Remark: Sports medicine makes use of this insight when constructing training 
schedules. We may immediately see: A rise in heart frequency correlates with a rise 
of lactate concentration in the blood. However, this relation is not linear. Instead, 
lactate concentration increases in an exponential manner. We will soon discover 
what this means. 

From repeated measurement, it follows that it is sufficient to focus on the ranges 
D = [40,230] (boundary values for pulse) and B = [0.3,10] (boundary values for 
lactate). Any statements that go beyond these ranges amount to speculations, about 
which one should be skeptical. © <<< 


In order to draw the graph of the function (Fig. 6.2), we inscribe the cor- 
responding value y = f(a) for each value x € D in the Cartesian coordinate 
system. In general, the resulting graph is a line, which does not, however, 
look as in the picture to the right, because, by definition, a function is only 
given if the relation x > y is unambiguous: Each y-parallel can intersect the 
function graph no more than once! 
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ordinate y 


abscissa 


Fig. 6.2 function graph (on the right: not a function!) 


Inverse function 

Let y = f(x) be defined in D = [a,b]. Then, for every x € D, exactly one 
y € B (Fig. 6.3, left) is assigned. Often, we need the concept of the inverse 
function: 


If y € B matches exactly one x € D, then y = f(x) is invertible. 


yh 


ot 
< 


Fig. 6.3 invertible and non-invertible function 


The immediate and obvious theorem holds: 


A function is invertible if it constantly increases or decreases, that is, if it is 


“monotonic”. 


>>> Application: conclusions from the lactate values 

Our lactate function is invertible: It increases monotonously, which means 
that, at a higher heart rate, the body produces more lactate. Since lactate 
does not readily disappear from the blood, one can infer the impact of the 
lactate levels after an extreme athletic performance. <<< 


Now, we want to show by means of simple examples how the inverse function 
can be calculated when a mathematical formula exists for the function or 
how it can be found graphically if it is only given empirically. 


® Remark: In mathematics, the inverse function of a function f is denoted by f~+ 
or also by f* where the first term should not be confused with e The notation for 
the inverse function is not uniform for typical calculators. For example, the names 
of the inverse of the tangent function are found on most calculators as 

INV TAN, TAN-1, ATAN, ARCTAN. © 


6.1 Real functions and their inverses 241 


>>> Application: linear functions and their inverses 
Determine the inverse function of the linear function f(x) = ka +d 
Solution: 
We “solve” for x: a 4 d 
z= — a a fly). 

Now, y is the independent variable and x depends on y. In mathematics, it 
is common that the independent variable is denoted by x and the dependent 
variable is denoted by y. Thus, the variables in x = f~'(y) are replaced by 
(x <> y), and we get y = f*(x) as the inverse function of y = f(x) 

1 d 


Y= 7L-—. daa 


k k 


The inverse of a linear function is, thus, again a linear function. This becomes 
immediately apparent since an exchange of the coordinates of a point results 
in a reflection of the point on the line y = 2, which is called the “first median” 
(Fig. 6.4). If we apply this reflection to all points P~! of « = f~!(y), we 
obtain the inverse function x = f~!(y). Since y = f(x) has the same graph as 
x= f-'(y), we get: 


The graphs of y = f(x) and y = f~!(x) are symmetric with respect to the first 
median. 


For empirically defined functions, one has to be satisfied with this reflection. 


ft 
7 


Fig. 6.4 variable exchange Fig. 6.5 three temperature scales 


>>» Application: conversion between Celsius and Fahrenheit (Fig. 6.5) 
Compute the function f : C — F and its inverse f-': F > C. Which 
temperature in C corresponds to 451°F?! What temperature in C and F 
has the same numerical value? 

Solution: 

In Application p. 24, we derived the formulas for the conversion of the two 
linear temperature scales. If we call x the temperature in C,, then y is the 
temperature in F’. 


1¥For film buffs: In the film adaptation of a novel by Ray Bradbury it is the temperature which 
leads to the ignition of paper. 
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With the considerations of Application p. 241, we ensure that the following 
holds for the function and its inverse function: 


hie: y= 0432, \ y= 2(2~32), (6.1) 


From this, we see that 451°F corresponds to 3 (451° — 32°)C x 233°C. An 
easy to remember pair of corresponding values is obtained by swapping digits: 
16°C correspond to 61°F. 
If C =F, then x equals y and thus 0.76 

9 


4 
w= —a+ 32 se 32 > 2 = -40 => -40°C = -40°F. 


We get the same result either by equating the inverse function 3 (a-32) with 
x or, in a more complex manner, by equating (= intersecting) the function 
and the inverse function in Formula (6.1). <<< 


>>> Application: parabola and inverse function (Fig. 6.6) 

‘In standard position”, the “unit parabola” has the equation y = x7. The 
graph is not monotonic in the entire domain —-co < x < oo. Therefore, the 
function is not invertible. If we restrict ourselves to positive values of x, the 
graph is monotonically increasing and the function is, therefore, invertible. 
Solution: 

The function y = x? with 0 < x < oo has the inverse x = +/y with 0< y< ow. 
We write the inverse function (interchange x and y): 


y=tJ/x with O0<a2<oo. 


® Remark: All parabolas are similar to each other (this is not true for the other 


conic sections, i.e. for ellipses and hyperbolas). The graph of the parabola 
y=aa" (0<4< 0) 


looks the same as the unit parabola when the unit length is multiplied by the factor 
Va. The inverse function is, then, 


y=f= with 0<a2<oo. 
a 


>>> Application: equilateral hyperbola and inverse function (Fig. 6.7) 
The asymptotes of an equilateral hyperbola are perpendicular. One can, the- 
refore, rotate them in such a way that the axes of the coordinate system 
coincide with their asymptotes. Then, the equation of the function is 


@ << 


a 
y=— (-wK<ar<o, r#0). 
af 


What is the inverse function? 
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y \ Ph) 
7 


Fig. 6.6 inverse parabola or ... Fig. 6.7... equilateral hyperbola 


Solution: 
Given the function y = ¢, we change the variables: x = [ and the inverse 
function reads y = 4 (x #0). Therefore, the function is self-inverse. 


® Remark: The vertices lie on the axis of symmetry: 2 = 4 > e=a>nez=y=stVJa. 
Among all hyperbolas, the equilateral hyperbolas play a special role as circles do 
among ellipses. The equilateral hyperbolas are — like parabolas and circles — all 
similar to each other. 


7) A viewing angle in degrees 


2600 
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Fig. 6.8 The relation between the viewing angle y and focal length 6 is indirectly 
proportional: The larger the focal length, the smaller the viewing angle. Based on 
information provided by the manufacturers, the given approximation formulas can 
be calculated. On the graph of the function, the manufacturers’ data are marked 
with small circles. 


Fig. 6.8 shows a practical application where the function graph is an equilateral hy- 
perbola: The relation between the optical angle y (measured diagonally in degrees) 
and focal length f of a lens is indirectly proportional. The inverse function is of the 
same type. ® << 
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6.2 Power, exponential, and logarithmic functions 


A function of the type y = 2” with a constant exponent and a variable base is 
called a power function. 


We have seen special power functions on many occasions. For n = 1, the 

function is linear (the graph is a straight line). For n = 2, the function is 

quadratic (the graph is a parabola). For n = 3, the function is “cubic” (the 

graph is a cubic parabola). The power functions can also be equipped with 
Th 


1 
negative exponents, and the rule x” = — comes into play. As a simple 
x 


1 
example, we have already considered the function y = — with an equilateral 
x 


hyperbola for its graph. 

Consequently, n is allowed to be any rational or real number. The following 
rule applies: x¢ = /z?. 

It is obvious that the power functions y = x" and y = °/x are each other’s 
inverses. Their graphs are, therefore, symmetric with respect to the first me- 
dian. 


>>» Application: calculation of points in the decathlon (Fig. 6.9) 
The calculation of points in the decathlon uses the formula 


y =alb- 2° 


to derive the number of points y (a, b, and c are constants where b is always 
the limit above and below which there are no more points). The value z is the 
output that is actually achieved. Discuss the function for a = 25.44, b = 18, 
and c = 1.81 (100m sprint), as well as for a = 51.39, 6 = 1.5, and c = 1.05 
(shot put). Which time in the 100m sprint achieves 500 points? 


\ pO sill 


1000 Sa ——_— 


10s S ~ 20s 


40m 15m 20m xX 


Fig. 6.9 points in a decathlon 


Solution: 
In the case of the 100m sprint, the formula reads y = 25.44(18-a)1". In the 
case of shot put, it reads y = 51.39(a - 1.5)1-°°. The function graphs can be 
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seen in Fig. 6.9. In shot put, the number of points increases almost linearly 
with the achieved distance. Extremely good performances in the 100 m sprint 
(the magical 10s) can thus be said to be “rewarded disproportionately”. 
In order to calculate x from a prescribed y, we use the inverse function: 


1 0.5525 
= 25.44(18 - 7)'8! = (2 _)™™ = 18- =i9-(—— 
a (18-2)! > (5) — (a) 
It needs x * 12.8 seconds in order to reach y = 500 points. <<< 


>>» Application: uniformly inflating a balloon (Fig. 6.10) 

The volume of a balloon will uniformly increase by inflating, wherein the 
balloon substantially retains its shape. How do the length scale or the surface 
area change? 


V(t) 
100 
S 
se) 32 
5 50 r(t) 
r(t) § 
‘3 25 S(t) 
ss V(t) 
0 
t 0 10 = |20 630640 t 
water depth [m] 
Fig. 6.10 inflating a balloon Fig. 6.11 balloon under water 


Solution: 

We know from the section on similarities that volumes change with the cube 
of the length scale r and surface areas change with the square of r. We will 
now apply the time ¢ on the z-axis (“abscissa”). By assumption, the volume 
increases linearly with respect to time. Then, the length of the scale varies 
with the cubic root of the time: r(t) = ¢/3. The surface S changes with 
the square of the length scale r: S(t) = (44/3)? = 17/3, Fig. 6.10 shows the 
corresponding diagrams. 

® Remark: The equation does not contain any of the gas laws (see next Application). 
The air pressure inside the balloon will rise slightly until it bursts, thus reducing 
the actual volume and the actual surface. When inflating a bicycle tube, the outer 
shell prevents the tube from expanding. This does not increase the volume, but, as 


desired, only the internal pressure. ® <<< 


>>» Application: submersion of a balloon (Fig. 6.11) 
A balloon is pushed under water. How do the volume, the scale of length, 
and the surface change? 
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Solution: 

Let Vo, ro, and So be the volume, the radius, and the surface area of the 
balloon, and let po = 1 bar be the initial pressure at the water’s surface. 
This time, the abscissa is the depth of the water t. At constant temperature, 
Boyle’s Law is valid: pV = po Vo = const. The external pressure p increases 
by 1 bar with every 10m of depth: 


t 
pa ie with r=1l+—. 
10 10 


This applies to the volume, radius, and surface area of the balloon: 


v= Be Vor? r=roa "3, S= Sor. 
Pp 


Fig. 6.11 illustrates quite clearly the dependencies; along the ordinate axis, 
the dependent variables are marked by the percentages of the various initial 
values. <<< 


In Application p. 245, the dependence of the volume V on the initial volu- 
me Vp was indirectly proportional to the “depth factor” t;. Now consider the 
following questions: 


>>» Application: decrease in brightness (Fig. 6.12) 

In a lake, the intensity of light decreases by p = 7% with every 1m of water 
depth. What is the intensity at a depth of x meters? At what depth is 50% 
of the original intensity left? 


Fig. 6.12 Left: Deep Blue. Right: Red is the camouflage under water! 


Solution: 

Let Lo be the light intensity at the surface. At a depth of 1m, we have only 
L(1) = 0.93 Lo. At a depth of 2m, we have L(2) = 0.93 (0.93 Lo) = 0.93? Lo. 
At a depth of 3m, we have L(3) = 0.93 (0.93 L(1)) = 0.93° Lg. So, at a depth 
of zm, we obviously have 
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For x = 9, we thus obtain a light intensity of L(9) = 0.520Lo. For x = 10, 
we obtain L(10) = 0.484 Lo. So, it is somewhere between a depth of 9m and 
10m that the light intensity is only half of the original intensity. 


® Remark: The various light components lose their intensity at different rates. For 
example, blues and greens prevail at greater depths. At a depth of 30 meters, blood 
appears greenish to the diver! The red parts disappear first. This is why red provides 
a good camouflage under water. It can only be seen clearly when it is photographed 
with a flash as in Fig. 6.12 on the right! © <<< 


Although the tasks in Application p. 245 and Application p. 246 appear to be 
very similar at the beginning, they lead to substantially different functions. 
Note that in the formula D(x) = 0.93” Lo, the base is 0.93, and thus, it is 
constant. The exponent is the variable . 


A function of the type y = a” with a constant base and an exponent as a 
variable is called an exponential function of base a. 


A typical example of an exponential function is the relation between the 
interest and capital: 


>>> Application: monthly interest 
The formula for the interest on an initial capital of Ko over x years with an 
annual capitalization (with an interest rate of p percent) is 


K=Kpa” with gota, 
100 


What is the refined formula for monthly interest if the capital at the end of 
the year should be the same? 


p=9% 


p=6% 

Ko p=3% 

15 10 5 0 5 10 15 years 
Fig. 6.13 different interest rates 


Solution: 
Let q be the monthly interest in percent. Then, after « months we have 


K = Ko (1+ 7§5)”. In fact, after 12 months, it must be true that 


12 
q Pp q 12 12 
Relies?) shel Hee We 100( 12/a-1). 
0 +5) a Z| aig. Ve) 
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This yields 

K=Kob'* with b= Wa. a 
Application p. 247 shows that a monthly interest return (or any other in- 
terest) leads to an exponential function. Further refinement allows us to 


compute interest rates at every second with 
K = Ko (/a)’* = Koa". 


Here, o » 3-10" is the number of seconds per year. Now, we have a function 
that gives us the value of the capital “at every time” x € R. Fig. 6.13 shows the 
associated exponential functions for p = 3% (y = 1.03”), p = 6% (y = 1.06”), 
and p = 9% (y = 1.09”). We choose the current time to be x = 0. This 
calculation of capital can also be used for negative periods of time. We can, 
for example, ask how much capital we could save up to a certain date in 
order to get paid the capital Ko at that moment. 

® Remark: The study of the exponential graph in Fig. 6.13 shows that, in the long 
run, a doubled or even tripled interest rate “generates” disproportionately more 
capital. This can be good for savings but can be potentially ruinous for borrowers. @ 


Without proof, the following quite non-trivial theorem applies (see Fig. 6.14 
or Application p. 286): 


y (scale 1: 400,000) 


— pd _ 9@ 
f(x)=2 2 y=2° y=2" 


Fig. 6.14 exponential vs. power function 


Each exponential function a” with even smaller base a > 1 becomes “obsolete” 
at some point compared to any other power function x” with a large exponent 


n. 


This means that it is not possible to catch up with the exponential function 
in the “long run”. 

>>> Application: return over a very long time 

Just as a theoretical juggling with numbers: Suppose you had one monetary 
unit (1 MU) given to the bank 2,000 years ago and the bank is obliged to 
reimburse at a lower but annual interest rate of p = 0.5% (1%, 1.5%) over 
the years. How many units of money would you have at the bank now? 
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Solution: 
p=0.5% => 1.0057°°.1 MU s 2-104 MU, 
p=1.0% => 1.0107°°.1MU x 4-108 MU, 
p=1.5% => 1.015%,°.1 MU ~9-10!2 MU. 


® Remark: Just to give an idea of these magnitudes: If we think of a monetary unit 
as a distance of one kilometer, then the last number corresponds to about 9-10!? km, 
i.e. a light-year. ® da 


An interesting example in which this is applied can be found in the Appendix 
chapter on Mathematics in Music: The length of the air column in wind 
instruments grows exponentially with the tonal depth (Application p. 517). 


Logarithms and exponential equations 


In practice, a common equation of the form a” = c is called an exponential 
equation. In order to solve it, we need the inverse of the exponential function. 
One has to “bring down” the exponent. We define: 


The inverse function of the exponential function y = a” is a logarithm to the 
base a: y = log, x. The logarithm of x to the base a is a number y that is 
obtained by raising a to the power wx. 


Now, we have 


a => y= lop, a. (6.2) 


In practice, one is satisfied with the “logarithm” log;, x (the base 10 is usually 
indicated), and the “natural logarithm” In x (sometimes called Logx). In 


d 


these cases, the base is 10 or Euler’s number 


ex 2.71828183, 


which is discussed in greater detail in the Appendix. Both logarithms are 
available on calculators, but, on PCs, only the natural logarithm is usual- 
ly implemented. In the section on power series, we will see how computers 
calculate the logarithm. (However, we do not need to worry about it.) 

For logarithms, the following important rules hold: 


log, %1 +log, v2 = log,(a1-22), 
Z1 
log, 71 —-log, 2 = log, —, 
x2 
n- log. = loga”, (6.3) 
4 -log,x = log, Vz. 


Proof: The proof is purely formalistic and uses the rules of calculations with powers. 
We find it sufficient to prove only the first statement. The remaining statements can 
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be proven in an analogous way: Since the logarithm and the exponential function 
“compensate” each other, we always have y = log, a’. So, we get 


log, v1 + log, x2 = log, [ate v1 + log, v2)] = 


= log, [alese 1. lB, #2) = log, (a1 - 22). © 


With logarithms, we can reduce the multiplication and division of numbers 
to the addition and subtraction of the respective logarithms. The power and 
root function can be reduced to multiplying or dividing numbers. This is 
still one of the most important applications of logarithms. A computer is 
constantly expected to do this internally. The invention of logarithmic tables 
was a major breakthrough in applied mathematics. (John Napier (1550- 
1617) invented calculation with logarithms and published the first logarithmic 
tables in 1614.) 

The relation between different logarithms is given by constant factors. For 
example, if the a-logarithm of a number z is known, then we also know every 
other logarithm, such as the b-logarithm. We have the formula 


log,z =k-log,x with k= = constant. (6.4) 


log, b 


Proof: Given the definition x = plogyx we can also deduce 
log, (x) = log, (p08), 


Following the rules of calculation for logarithms, this equation can be transformed 
into 
log, = log,x - log, b, 


which results in 


1 
] v= l - 
ome = slog, 2 © 


a 


(1/0) x 


Fig. 6.15 logarithms with different bases 


® Remark: The graphs of the various logarithmic functions differ by a scaling in 
the y-direction (Fig. 6.15). Programmers sometimes need to be familiar with For- 
mula (6.4), because the computer only “knows” the natural logarithm Inz. The 
adjusted formula is 

Ina ® 


ie eo 
a et 
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>>> Application: cooling a liquid 

A liquid with a temperature of 97° cools down to a room temperature of 22°. 
After ten minutes, it has a temperature of 76°. What is the temperature at 
n minutes? When does it reach body temperature 37°? 

Solution: 

The initial temperature difference is A = 75°. After ten minutes, it is only 
Aio = 54°. When A is reduced by a factor of k per minute, it has decreased 
by a factor of k° in ten minutes: Ajo = k!°A. Thus, 


[54 
54=75-k 9 >k= Y = = 0.96768. 


After n minutes, the temperature ty, is, thus, 
ty = 22° + 0.96768” - 75°. 
If t, = 37°, then we have 
37° = 22° + 0.96768" - 75° = 0.96768” = 0.20000. 
We take the natural logarithm on both sides of the equation and obtain 
In(0.96768") = In 0.20000 = n- 1n 0.96768 = In 0.20000 = n * 49. 


Therefore, the liquid has reached body temperature after about 49 minutes. 


® Remark: The time it takes for a liquid to equilibrate to the outdoor temperature 
depends on the insulating capacity of the container and also on the size of the area 
of the contact surface with the air: Deep lakes, therefore, heat up at a slower rate 
in early summer than shallower ones. @ da 


>>> Application: discounting 

At what cash value B can a debt of 60,000 MU, which is overdue in 5 years, 
be superseded today when an annual rate of 7% is applied? At what value 
must an annual repayment rate (due at the end of the year) be set? Use and 
prove the following formula, which is important in financial mathematics: 


1- 
l+qtq?+...¢q”!= a, (6.5) 


Solution: 


After one year, the present value of B is “worth” 1.07 B. After 5 years, it is, 
therefore, 1.07° B = 60,000 > B = 42,779 MU. 
Let R be the annual rate. Then, 


1.074 R+ 1.07? R+ 1.077 R+1.07' R+ 1.07° R = 60, 000. 
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We calculate the sum using Formula (6.5). The “proof” of the formula is the 
simple calculation (multiplication) of the equivalent formula 


(L+q+q?+--+q™")(1-q) =1-4". 
Therefore, we have 


11,077 
11 = 60,000 > R = 10, 433. << 
1- 1.07 


>>> Application: an unfair offer 

The following text is taken from an advertisement in a daily newspaper: 
“nenthouse, ..., € 456,000. Financing: single premium € 150,000, monthly 
€ 570 (Note: 30 years) ...”. 

What is striking here? What happens if the interest rate is 5%? 


Solution: 

After a rough calculation, any buyer will opt for the offered financing. The 
buyer may already have €456,000 in cash, but paying in instalments is still 
much cheaper. Even computing without interest and compound interest will 
yield a much smaller amount 


150, 000 + 30- 12-570 = 355, 200. 


Which bank would give a negative interest rate? 

The seller’s declaration: The whole thing is financed by means of cheap yen 
loans. If the yen loans were no longer so low, one could, indeed, “roll over” 
by swapping these yen debts into debts in other currencies. No problem!? 
Suppose you have to roll over and, therefore, take a “local” loan with an 
annual interest of p= 5%, which is still a bargain. We calculate the monthly 
rate M as follows: After the single premium is paid, we still have 456, 000 — 
150,000 = 306,000 € due. According to Application p. 247, we have b = 
\/1+ p/100 = 1.00407. As in Application p. 251, we relate all amounts to 
the same time, for instance, after about 30 years or 12-30 (= 360) months. 
We use Formula (6.5): 


. 1 — 9260 
306, 000 b°°° = Mf (b°°? + B78 +... 4+b? +541) =M = => M =1,615€. 


In such a case, which is not unlikely to happen, the monthly payment is, 
therefore, nearly three times as high as what was given to the buyer at the 
time of buying — and this over a period of 30 years! <<a 


>>> Application: exponential growth 
A certain bacterial culture multiplies by 12% per minute. After how long will 
it have increased tenfold or thousandfold? 
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Solution: ido 
10=1.12 > Inl0=2In1.12 = ——— # 20 and 
meLoog mh? 
1,000 = 1.127 > 2=—— — w60. 
In 1,12 


After 20 minutes, a tenfold increase will have occurred. After one hour, the 
culture will have already increased a thousandfold! The immune system of 
the body can, by the way, “neutralize” an infection with up to 150 million 
germs. 


® Remark: This particular calculation would work better with a logarithm to the 
base 10, because log 10 = 1 and log1,000 = 3. However, this function is not always 
available on a PC. ® <<< 


>>> Application: the '4C-method to determine the age 

After the death of an animal or plant, the radioactive carbon isotope C 
disintegrates in its body with a half-life period of 5,730 + 40 years. What is 
the age of a mummy that contains 40% of the initial amount of 4C? 


Solution: 

Let Go be the amount of C of a living human, and let G be the amount 
of *C in the mummy in question. Let now x be the age of the mummy, 
measured in units of 5,730 years. Then, 


log 0.4 


G =0.5"Gp = 0.4=0.5" 
log 0.5 


= 1.3219. 


The age of the mummy is, thus, 1.3219-(5, 730440) = 7,575453 years. <<< 


>>> Application: decrease of air pressure 

Atmospheric pressure b decreases by half of the previous value after ho = 
5.5km height. What is b at a height of hkm if bo9(* 1bar) is measured at 
sea level? What is the air pressure at Mount Everest (almost 9km above sea 
level)? What is the height at 0.8 bo? 

Solution: 


h 
We measure the height h again as a multiple x of the “unit” ho: x = i The 
0 
corresponding air pressure b is then 


b = 0.5" bo. 


1 
At a height of approximately 9km (= x * 1.6), the pressure is b » 300: 


In0. 
From 0.5” bo = 0.8 bo, we infer x = — : 


n0. 
the relative air pressure is only 80%. 


x 0.32 > h=1,770m. At this height, 


® Remark: The altitude of 9km is considered the absolute top barrier in which a 
human can survive (only for hours) without artificial oxygen. Therefore, it is always 
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pointed out in airplanes that, when there is a drop in cabin pressure, oxygen masks 
must be used. 

When diving in mountain lakes at high altitudes, one has to take into account that 
the decompression is much more powerful than at sea level. At 40m depth, an 
external pressure of 0.8bar prevails, and there is only a total pressure of 4 bar + 
0.8bar = 4.8bar. The ratio when surfacing is oa = 6, which occurs only when 
diving from sea level to a depth of 50m. While the short-term plunge at 40m is 
still classified as harmless, you need a “decompression stop” when diving to 50m 
(usually several minutes akinetic and hovering at 5m) to prevent the “bends” (see 
Application p. 77). ® <a 


>>> Application: air pressure vs. water pressure 

Why does air pressure decrease exponentially, while water pressure decreases 
linearly? 

Solution: 

Liquids (water) cannot be compressed like gas (air). A “water column” of 
10m height always has the same weight (= linear growth), while the weight 
of a 10m “air column” increases proportionally to the pressure (weight) of 
the amount of gas lying above it. Thus, air becomes more compressed and is 
getting heavier (= exponential growth). <<< 


>>> Application: population growth (Fig. 6.16) 

Exponential growth in nature must eventually lead to a collapse. For example, 
an increase of population cannot be exponential in the long run. Interpret 
the graph in Fig. 6.16. 


p f\in Mio (Lima/Peru) 


20 
Megacity 


City t 


1900 1950 2000 


Fig. 6.16 demography of Lima and Lima-Megacity 


Solution: 

The growth of the city almost seems to be an exponential function. However, 
the growth of the megacity is “only” linear in the last decades, and from 1900 
onwards, it has even weakened. The “difference” between the two graphs — i.e. 
the number of persons who live in Greater Lima, but not in the city proper — 
shows that there is still a rising trend but one can recognize a “bend down”. 
See also Application p. 259. <<< 
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Logarithmic Graphs 


In textbooks of physics, chemistry, or biology, one can often find graphs 
showing exponential changes that appear linear. This is accomplished by the 
so-called logarithmic scale. Repeated multiplication of a number by the same 
factor will always cause the same addition in the logarithmic chart, and the 
problem becomes “linearized”. So, one can — at least graphically — obtain fast 
solutions to specific tasks. See Application p. 255, 6.24. 


>>> Application: exponential growth with logarithmic scale 

In 2001, Nigeria had approximately 125 million inhabitants and an annual 
growth rate of 3.5%, whereas Brazil had 160 million inhabitants and a growth 
rate of 2.1%. When will the population of Nigeria exceed the population of 
Brazil? Draw a logarithmic graph. 


300 Es mio. 
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Fig. 6.17 logarithmic population chart 


Solution: 
Mathematically, this problem is easy to solve: 


125 - 10° - 1.035” = 160- 10° - 1.0217 > 1.014” = 1.28 


log 1.28 
{= —— 8 
log 1.014 


Thus, after about 18 years, Nigeria’s population will exceed that of Brazil. 
In the logarithmic graph (Fig. 6.17), we do not plot the number y of indivi- 
duals but the logarithm log y of the number of individuals along the ordinate 
(with arbitrary base). Since the logarithm increases linearly, the instant of 
equal population appears as the intersection of two straight lines. 


® Remark: Population forecasts are always subject to inaccuracies (in this case, the 
data are partly estimated) and assumptions (constant growth rate). The future will 
show whether the figures coincide reasonably with reality. The small angle formed by 
the two lines in the diagram indicates that this result is not very accurate. Yet, there 
are predictions from the past that turned out to be right: In 1950, about 2.4 billion 
people were living on Earth, and the annual growth was 43 million, corresponding 
to a growth rate of 1.78%. Thereupon, one predicted almost 6 billion people (more 
precisely, 5.85) for 2000, which later turned out to be fairly accurate. ® <a 
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>>> Application: the good old slide rule ... (Fig. 6.18) 

The slide rule is based on logarithmic scales and allows the mechanical optical 
completion of basic arithmetic operations (and more). For example, in order 
to perform the multiplication 2 x 3, one pushes the 1 of the movable “tongue” 
over the 2 on the scale below, and moves to the 3 on the display. Thus, one 
has added the logarithms of 2 and 3: log2 + log3 = log6. On the lower scale, 
we naturally read 6. 


saad 
anisto @ 
Fig. 6.18 2 x 3 = 6, carried on slide 


® Remark: Until the invention of the pocket calculator and the widespread use of 
PCs, slide rules were indispensable in school, science, and technology. ® <<< 


>>> Application: double logarithmic scales (Fig. 6.19, left) 

Occasionally, you will also have diagrams in which the values are plotted 
logarithmically on both coordinate axes. Interpret the two graphs in Fig. 
6.19, where the energy consumption of different animals is registered for the 


various locomotions (a... swimming, b... flying, c... running). 
Wah 
kg.km 
10 - 
5 
2- b 
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Fig. 6.19 energy consumption of (a) swimming, (b) flying, and (c) running 


Solution: 
The non-trivial graph on the left? is difficult to understand for non-mathematicians: 


? According to McMahon and Bonner: On Size and Life. Scientific American Books. New York, 
1983. 
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The abscissa shows the applied body mass. In order to account for both small 
animals of only a few grams of weight and large land mammals, a logarithmic 
scale has been chosen. 

On the ordinate axis, the energy consumption per kilogram of body mass and 
kilometer travelled is plotted. It is apparent that there is a linear relation to 
the mass when this energy expenditure is logarithmically distorted. 

For the untrained chart reader, the image on the right is just as meaningful 
without distorting the data. The relevant information can clearly be seen 
here, though one can still imagine large land mammals with a mass of 500 kg 
and more quite effortlessly: 


Fig. 6.20 running or flying? 


1. Swimming is (at least for fish — here, salmon were examined), by far, the 
most efficient locomotion. This is probably because gravity is compensated 
by buoyancy. 


2. With more than 10 kg of body mass, running on land is more efficient than 
flying. (Flightless ostriches have a mass of 50 - 150 kg!) 


3. In relation, small animals consume significantly more energy to perform 
an action. In order to keep the energy levels stable (see Application p. 85), 
a shrew has to eat high quality (animal) food of more than its own weight 
every day, while the elephant survives on inferior food (grass), with the 
daily amount equalling about 7% of its own weight. 


® Remark: Even when mammals are in “idle motion”, a lot of energy is required in 
order to maintain their metabolism. Reptiles are essentially more moderate. There- 
fore, mammals are “ready for action” at low temperatures and have more stamina 
than reptiles. Science is not yet sure about whether dinosaurs, which are commonly 
associated with reptiles, were poikilothermal or whether they were warm-blooded 
like birds , which are their only surviving descendants. Anyway, after the extinction 
of the large dinosaur species, only mammals are left in the “upper weight class” of 
terrestrial animals. ® da 
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6.3 The derivative function of a real function 


If the function graph y = f(z) has no kinks or jumps, then there is a line t 
touching the graph called the tangent at the point P(x/f()). The tangent 
can be defined as a limit position of a chord s that joins P with a “neighboring 
point” Q(a+h/f(a+h)) (Fig. 6.21): 


t= lim s=lims. 
Q>P 0 
The slope of the tangent tells us something about the shape of the curve in 
the vicinity of P. If, for example, the tangent is horizontal (slope 0), then 
the function graph has a local extremal value (minimum or maximum). 
If the tangent has the inclination angle a, then the following relation holds: 


tanar= jin 0 )= F00) 


m “ (6.6) 


This value is referred to as the derivative of the function f(x) at the point 
x and is denoted by f’(x). We have: 


The derivative f’(a) of a function f(x) at x yields the tangent of the angle of 
inclination of the tangent: f’(a) = tana. 


Fig. 6.21 chord and tangent Fig. 6.22 graphical differentiation 


If x traverses the entire definition interval, then the term f’(2) provides 
a new function, namely the derivative function. We shall soon see that for 
mathematically defined functions, the derivative function can be calculated 
directly following a set of simple rules. For empirically defined functions, one 
has to be satisfied with “graphical accuracy”, and the investigations will be 
limited to qualitative questions: Where does the function have a maximum 
or minimum? 
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The computation (or construction) of the derivative function is also called 
differentiation. The computation of derivatives falls within the scope of dif- 
ferential calculus. 


Graphical differentiation 


In all cases, graphical differentiation works within graphical accuracy (Fig. 
6.22). The derivative function y’ = f’(x) is constructed pointwise as follows: 
One chooses an arbitrary point P of f (the function to be differentiated) and 
draws the tangent line ¢ with graphical precision. If one shifts t in parallel 
through the auxiliary point U(-1/0) on the x-axis, then, in accordance with 
the definition, this straight line intersects the y-axis at a point T with height 
f'(z). The corresponding point P’ of the derivative function lies above (or 
below) P. Thus, the corresponding point on the derivative function equals 
P'(xp/yr). 


® Remark: If you select U(—A/0), then the derivative curve is superelevated. ® 


>>> Application: trends in population development (Fig. 6.23) 

Using graphical differentiation, one can analyze population trends more effi- 
ciently. Discuss Fig. 6.23 (cf. Application p. 254). The function f(x) shows 
the population development of the city center of Lima. The function g(x) 
shows that of the entire city. 


p fin Mio (Lima/Peru) 


1900 1950 2000 


Fig. 6.23 trends and reversals in population development 


Solution: 

Zeros of the derivative function indicate stagnation, extreme points (maxi- 
ma and minima), and reversals. So, you can clearly see that, between 1940 
and 1950, there was almost zero growth in both the Megacity and the city 
proper. Furthermore, the Megacity’s peak of population growth has already 
been passed around 1990, whereas the growth of the town center continues 
unabated. <<< 


>>> Application: biking in the mountains (Fig. 6.24) 
A cyclometer counts a wheel’s number of turns n, the corresponding time, and 
the altitude with an altimeter. How can it, thus, create an altitude profile 
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or slope profile of the route (regardless of whether one inserts intervening 
breaks)? 


Solution: 

Let u be the constant circumference of the wheel. In smaller intervals At, we 
measure the number n of revolutions of the wheel and the height difference 
Ay. Hence, the distance As and the average speed are 


As 


As=nu, v=— 
At 


in the interval in question. Only As > 0 matters, otherwise the biker is just 
having a break. The corresponding distance Ax in the normal projection (in 
the top view) is calculated as 


Ag = \/ (As)? - (Ay)?. 


Graphically, we find a neighboring point (a+ Ax/y+ Ay) to any point (a/y) 
on the upper curve in Fig. 6.24. Constant repetition results in a polygon 
that is an approximation of the mountain profile of the road. The smaller 
At is chosen, the more accurate the curve is. Due to the highly inaccurate 
measurable difference in height, a compromise has to be reached (measured 
approximately every 20 seconds). 

The average angle a of inclination is obtained from the relation tana = 
Ay/Az (multiplied by one hundred, this equals the slope specified as a per- 
centage). The lower curve in Fig. 6.24 is the slope profile whose ordina- 
tes are proportional to tana and whose points have, thus, the coordinates 
(x/c- tana) (c is the constant scaling factor). Therefore, it is essentially the 
derivative curve of the mountain profile. 


® Remark: Cyclists who want to evaluate “their” curves, prefer graphs which are 
very similar to the above described, but they do not have the normal projections 
of the distance drawn on the abscissa. These diagrams show the actual distance on 
the abscissa: Then, the Az-intervals are replaced by As = Ax /cosa. Yet, even for 
slopes with an angle a < 8°, we still have cosa > 0.99. Visually, the difference is 
barely recognizable. ® 


} 2 7 i “Googe 
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Fig. 6.24 Profile of a bike tour in the mountains. Detailed 3D-information about 
your own bike or those of a virtual cyclist. 


® Remark: The Garmin Edge Series is a GPS-based cycling computer that of- 
fers the possibility to race against a virtual partner. For that purpose, your own 
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track information or these of other bikers are displayed on the computer. This al- 
lows one to race against others in real time and under the same terrain conditions 
(slope, ground, etc.). The respective position of the virtual partner (lead, lag) is 
shown on the graphic display. One can, for example, upload his/her own data to a 
corresponding website and invite others to ride/compete in the virtual simulation. 
In addition, you can cycle along downloaded tracks (also with/against a virtual 
partner). Software download at www.garmin.com. © <<< 


Notations 


Engineers and physicists like to use the so-called differential notation. Instead 
of f’(x), they write 

d d 

f'(z) = —f(x) or, even shorter, f’(x) = i 

dz dx 
The main argument in favor of this type of notation is that, in the various 
applications, the independent variable is not always called x. This is often 
the time tf. In order to indicate that one does not differentiate with respect to 
x but with respect to another variable (for example t), the following notation 
has become common: 


. ad df 
p= )e ox 
dt dt 
Physical interpretations of the derivative 
In addition to the geometric interpretation of the derivative (slope of the 
tangent), there are various physical interpretations. Here are two important 
examples: 

1. The instantaneous velocity is the derivative of the path-time function. 
From physics, we know the basic formula 


Ss 
s=vu:t or aes, 


So, the speed is the quotient of distance and required time. This formula is 
valid only if the speed throughout the motion is constant — or one obtains 
the average speed as a quotient. The shorter the selected time interval is, 
the more accurately the average speed will match the instantaneous velocity. 
The latter is obtained through a limit process, and the tangent is likewise 
obtained from the limit position of a chord of a curve. It is this limit process 
that is performed when differentiating. 


If we interpret the distance as a function s = s(t) of time, its derivative v = u(t) 


is the “velocity function” u(t) = 8(t) = =. 


2. The instantaneous acceleration is the derivative of the speed-time function. 
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Analogous considerations apply to the acceleration as the derivative of the 
velocity function with respect to the time. 


If we interpret the speed as a function v = u(t) of time, its derivative a = a(t) 


is the “acceleration function alt) =o(t) = 


>>> Application: free fall 
Thanks to physics, the formulas for free fall are known: 


3 = oP, v=gt,  a=g(=9.8lms"”). 


Show that v = s and a=v hold true. 

Solution: 

Soon, we will see formulas that reduce the calculation to a single line. Ho- 
wever, in order to promote understanding of the limit process, we want to 
perform one: 

After t seconds, a body will have fallen st meters (neglecting drag). At the 


end of a “short” period of time dt, the distance is s(t + dt)” meters. In the 
time dt, the body has, thus, fallen a distance of 


ee S(teat)® SP - 5 [«? + 2tdt + (dt)? - 27] = : [2tde + (at)"] = 9 [+ I . 


meters. For the average speed v in the short time interval dt, we, thus, have 


Now, we let dt be “infinitely small”. Then, for the instantaneous velocity, we 
get v= gt. 

To compute the instantaneous acceleration: After t seconds, the body increa- 
ses its speed in the short time dt from gt to g (t+dt). The average acceleration 
(= change in velocity / time) is, therefore, 


g(t+dt)-gt gadt 
a= ao =e 
dt dt 
Here, we do not even need to perform a limit process: The acceleration is 
already independent of dt. 
Obviously, the acceleration is obtained by differentiating the path-time func- 
tion twice. Symbolically, we write a = §. <<< 
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6.4 Differentiation rules 


In a symmetrical manner, we want to derive rules that makes differentiation 
easy. 

The derivation of a simple class of functions can be calculated immediately: 
The derivative of a linear function f(a) = kx+d equals the constant value k, 
and the derivative of the constant function f(x) = d vanishes: 


(kx+d)! =k. (6.7) 


Proof: a) Purely graphically: The graph of the function is, indeed, a straight line 
with a constant slope. So, the tangent of the angle a of inclination is also constant. 
b) From the limit process, we obtain 


f(xth) f(«) 
i SN 
f'(x) = lim aa aac lim il lim k = k. 
h>0 h n=0 kh h0 


For k = 0, we have f’(x) =0. © 


® Remark: As an example, the function s(t) = vot + sq describes the distance at 
constant speed and, consequently, § = vp. ® 


Quite often, one needs the derivative of the sine or cosine function (Fig. 6.26): 


(sinz)'’=cosx, (cosx)' =—sinz. (6.8) 


Proof: We prove the first part: 


sin(a +h) -sinx 
a = 


ey eee li , 
(sin x) lim h COS & 
We use the well-known formula 
sina — sin 8 = 2cos oF sin So" 


and derive Formula (4.21) in Application p. 141. Then, we get: 


sin(v+h)-sing |, 2cos(x+ ) sin 4 7 
h h>0 h 


aie 
(sin x) lim 


sh 
sll 5 


= li ct = COS 2. 
lim cos(x a COs x 


© 


® Remark: An illustrative and easily comprehensible geometric proof of the relation 
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Fig. 6.25 (sinz)’ = cosz Fig. 6.26 sina and cosz 


could look like this (Fig. 6.25): 

We consider the unit circle and the two points P and Q on it, which correspond to 
the track angles 2 and x + dx. The arc (and in limiting cases, thus, the chord) from 
P to Q has a length of dx. We consider the z-parallel through P and the y-parallel 
through Q. The two lines intersect at a point R. At a small angle, the triangle PQR 
is “almost” a right triangle with hypotenuse dz and the angle «+dz (angle between 
the circle tangent at Q and the y-direction) at Q. The difference in height dy of P 
and Q equals RQ = dy. This value can, obviously and by definition, be interpreted 
as the growth of the function y = sinz, when « is increased by dz. Initially, we have 
the approximation, but in a limit process, it is exactly the relation to be proved: 
dy = cosa. 

Stich a proof is not very much appreciated by pure mathematicians. In technical 
books, however, it is quite common. The arithmetic of “infinitely small quantities” 
works surprisingly well. The only risk is the difficulty when it comes to identifying 


errors — one must be careful (Application p. 369). ® 


>>> Application: At what angle do the functions 
y =sina and y = cosx intersect? (Fig. 6.26) 


Solution: 
The two functions are “periodic”, i.e. they repeat on and on (period length 
27). In the interval [0,27], the graphs intersect twice: 


51 


1l=>tanz=1> 7 


sin x = cosx > SE 
COs & 4 4 


® Remark: At x1, the sine curve has a tangent with the slope y/(m/4) = cos7/4 = 
1/\/2. The angle of inclination is, therefore, 35.264°. Analogously, the graph of 
the cosine at x; has a tangent with the slope y'(m/4) = —sinz/4 = -1//2. The 
inclination angle is —35.264°. The intersection angle is, thus, 70.53°. ® <a 


The “queen” of functions 


There is — up to a multiplicative constant — just one function that does not 
change when it is differentiated, namely the Euler function y = e”: 


(e*)' =e", (6.9) 
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Proof: It must be proven that 


ath x h 
= = 
a ea 
Therefore, one has to show: 
et —] 
lim = 1. (6.10) 


Without going into the details of the proof (which would require some lemmas that 
we have not yet deduced): It is highly likely that Euler had defined “his number” e 
so that the limit in Formula (6.10) attains the value 1, namely by 


n> co 


lim (1 + -) = 2.71828183.... 
n 


Then, 
nh nh m 
e” = lim (1+ -) = lim (1+ ~) = lim (1+ “) F 


noo n N00 n moo m 


From now on, briefly: Using the binomial theorem (see p. 468), we have 


m(m- 1) | 
2 


m-1p } 


(a+b)™ =a™+ma EE oe em 


and with a = 1 and b=h/m, we infer for e”: 


from which 


Fig. 6.27 y =e” Fig. 6.28 y=cosha, y=sinhz, y=tanhz 


® Remark: This beautiful property of the natural exponential function has enor- 
mous implications for all of mathematics. One might call y = e” the “queen of the 
functions” for this reason (see Appendix B). Many other functions can be created 
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from this function, like the hyperbolic functions (Fig. 6.28, see also Application 
p. 273): 

e* +e” . ev -e* sinha e*-e* 

cosh x = oo sinh x = a tanh x = = 


: cosh e% +e” 


which we shall meet in connection with the “catenary” (Application p. 430). Via the 
detour of complex numbers, there is even a connection to trigonometric functions 
(Application p. 273). The proof can be done by means of the so-called power series 
expansion, which we will discuss in Chapter 8. @ 


>>> Application: At which angle does y = e” intersect the y-axis? 

® Remark: For all points on the y-axis, x = 0 holds. The function y = e” intersects 
the axis at the point (0/e°) = (0/1). The derivative there is also e° = 1. That is, the 
tangent has the slope tana = 1 and, thus, a = 45°. The angle made with the x-axis 
equals that made with the y-axis (Fig. 6.27). ® <<< 


Next, we show a few basic rules with which one can differentiate quite com- 
plex functions. In the following, f(x) and g(x) are two functions with the 
derivatives f'(x) and g/(x). Therefore, we have 


f(a +h) f(2) 
h o] 


/ ; / _ g(a +h)-g(2) 
x) = lim x) = lim : 
P() h=0 g (2) h=0 h 
It is immediately clear that multiplicative constants can be “factored out” 
while additive constants disappear: 


(k- ftd) =k f". (6.11) 


Proof: With F(x) =k- f(a) +d, we get 
y_y (Kk f(at+h)+d)-[kf(z) +d] _ 1. hives 
F lim i k lim k f(a). S| 
Numerical example: The derivative of the function y = 6cos7+2 reads y’ = —6sin x. 
Equally simple is the 


f(a+h) - f(a) 
h 


summation rule: (f +g)! = f' +9’. (6.12) 


Proof: 


(f4 o){(2) = tim [f(ath) + 9(x4 e] [f(x)+9(x)] _ 


p+ h oy a+ h d 
= py LEE # N= eI] + LoCo) ED] © 6 
® Remark: This formula is valid only if f’ and g’ exist, a fact which also applies in 


the cases of Formula (6.13) and Formula (6.17)! @ 
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»»» Application: Find the “first” positive maximum of f(x) = 5 +sin«. 
(Fig. 6.29) 

Solution: 

A maximum occurs if the function has a horizontal tangent, that is, if the 

inclination angle a of the tangent is 0. So, we must have 


tano= f'(2) =0. 
With the summation rule, we have 
27 


J 1 i 
f= (5) +(sinz)’ = —+cosz=0> cosz=-= >2=—. 
2 2 2 3 


In the neighborhood of the point (22/3 - 3), the sign of f’ changes. Left 
of this point, we have y’ > 0; right of it, y’ < 0 holds. Therefore, it is, indeed, 
a maximum. The curve “winds” about the line y = «7/2 (Fig. 6.29). 
A, 
f y =h4 sin x 


y=3 2n/3 X 


Fig. 6.29 f(x) = $+sing 


® Remark: In this example, it is important to calculate in radians, rather than in 
degrees. So, your calculator must be set to RAD and not to DEG. The setting 
GRAD refers to the so-called gradians (a right angle equals 100 gradians), which 
we will not use here (they are still used in surveying). The identification of the right 
angle with 90° (degrees) is due to the Babylonians who did not calculate in the 
decimal system. The latter was “exported” by Arabian mathematicians from India 
to Europe some 1,000 years ago. ® << 


Furthermore, the differentiation rule for products (Leibniz’s rule) reads 


product rule: (f-g)'=f'-g+f-g’. (6.13) 


Proof: Since F(a) = f(x)- g(x), 


teen ‘a S(@t+h)- ge +h) - f(z) -9(2) 
F(x) =(f(e)-9(2)) = lim 7 
Now we use a trick: We subtract an expression and add it again to “keep the balance”. 
By cleverly combining terms, this seemingly more complicated expression yields new 
“constellations”: 


That’s the point! 


= im LOE +R) ge th) -F (a) g(a +h) + F(z) g(a + h)-F(@) -9(@) _ 


F' 
h30 h 
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CEO ESC) 
h 


h-0 


g(a+h)+ f(x) lim see) =a) =f'gt+ fg. © 


Using the product rule, we can prove the following basic differentiation rule 
for the derivative of the power function y = x”: 


(a”)'=n-a"™! (6.14) 


Proof: We first assume that n is a natural number. Subsequently, we will prove 
the assertion for arbitrary real numbers, see Formula (6.25). 

The proof uses the method of “mathematical induction”: We already know that, for 
n= 1, we have (#1)! =1-2x° = 1 (“induction base”). Now let us assume the theorem 
is true for some n. If we can show that the theorem is also true for n+ 1, then it 
is valid in any case. So, we suppose that (a”)! = n-a2"-!. We have to show that 
(a”*!)' = (n+1)-2”. This is done by using the product rule 


(x1)! = (a@- a”) =1-(2")+a-(n-2™!) = 2" +n-2” =(n4+1)-2”. © 


>>> Application: bomb trajectory 

A parabola has the equation y = ax? +ba+c. Determine the coefficients a, 
b, and c so that the throwing distance is 20m and the throwing height is 8m 
in a horizontal terrain. How large is the launch angle a? 

Solution: 

The origin lies on the parabola (0 =0?a+0b+c > c = 0) for the sake of 
simplicity. For reasons of symmetry, the apex must have the coordinates 
(10/8). Therefore, 8 = 10? a+10b+0 > b = 0.8-10a. Furthermore, the tangent 
must be horizontal there, i.e. y’/(10) = 0. Applying the previous rules, we get 


y =2ax+b. 


Therefore, we find 20a+b=0 => 20a+0.8- 10a =0 > a=-0.08 > b= 1.6. 
Further, y’(0) = tana = 1.6. So, a launch angle of a = 58.0° is required. << 


The next rule is called the quotient rule. The proof uses the product rule and 
the differentiation rule for the power functions: 


fy fig (6.15) 


quotient rule: ( ; 
g g 


>>> Application: Calculate the derivative of the function y = tanx and 
show that its graph intersects the x-axis at 45°. 


Solution: 
® Remark: The function tanz can be written as a fraction: 


sin x 
tan x = 


cos x 
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Now, we can apply Formula 6.15 and find 


coszcosa-sina(-sinz) cos?z+sin? x 


(tan x)’ = 


cos? x cos? x 
Depending on whether we split the fraction into its summands or we use cos? # + 
sin? x = 1, we obtain 
(tana)’=1+tan?a or (tana)! = (6.16) 


cos? x 


The function tanz is again periodic (with period 7). The zeros arise from the 


equation tan x = 0 and are, thus, x =0, +7, +27, .... At x = 0, the derivative equals 
(tana)’(0) = 1, ie. the tangent is inclined at a = 45° (tana=1). ® <<< 


>>> Application: Witch of Agnesi (Fig. 6.30) 
Consider the function y = 2/(1 + x”). Show that the point (0/2) is a maximum 
and the tangents at the points (+1/1) subtend angles of +45° with the x-axis. 


Fig. 6.30 Witch of Agnesi (bell-shaped curve), and not a real “bell” 


Solution: 
® Remark: Applying the quotient rule, we find 
,_ 0-(1+2?) - 2-2 4x 


(1+ x?) ~(1+22) Sy re. 


y 


For x < 0, we have y’ > 0. That is, the curve is monotonically increasing until «x = 0, 
where it has a maximum. 
For « = +1 (= y= 1), we have y’ = tana = #1 > a= ¥45°. ® 


® Remark: The curve is bell-shaped, but should not be confused with the famous 
Gaussian bell curve (Fig. 8.4). ® <<< 


For “nested functions”, we need the so-called 


/ 


chain rule: (fl9(x)]) f'[9(z)]-9'(z). (6.17) 


The term f’[g(a)] is frequently called the “exterior derivative”. The expres- 
sion g’(x) is called the “inner derivative”. Technicians or physicists prefer the 
differential notation 
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Gos 


= . wl 
dx dg dx ot) 


Proof: The differential notation is actually an abbreviation for the following limit: 


(rote) = jn Loe I= fate) 
ny Liga + bY) = Foley] gle +h) - g(a) _ 
Ib glee h)~ 92) : 


Flg(@)|-9"@)- gy 


Examples are the best way to memorize the chain rule. 


>>> Application: some derivatives 
Determine the derivative of the functions 


sin x = 2 = _ e-e* 
, y=cos(3a* +1), y=tanhz= 


22 
=e =e Sr 
¥y > Y et 4 e-2 


Solution: 


y =e" => y! =e?" 2 = 2y, 


A r a 
y = ene => y 2 eine z COs 2, 


y = cos(3a7 + 1) > y’ = [-sin(3x? + 1)]- (6x) = -6a sin(3z? +1), 
7 ev —et eae (e* eet et +e") _ (e* ae Ne ae) 


er +e” (e?¥ +e°7)? 


= (tanh 2)’ = 1- tanh? z. (6.19) 


ddd 


In order to differentiate the inverse function, we use the following expression: 


! 1 
f") @)=a— 6.20 
(=F (6.20) 
Its main application is the elegant computation of derivatives of new function 
types. 


Proof: If g is the inverse function of f, then the corresponding graphs are symmetric 
with respect to the first median. If the tangent to f has the inclination angle a, 
then the symmetric tangent to g has the complementary angle of inclination, and 


we have i 
T 
tan(—- a) =——. 
ce ( 2 a) tana i) 
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>>> Application: derivative of the arctangent (Fig. 6.31, left) 
The inverse of the function y = tan x is called the arc tangent — arctan y (“the 
arc of the tangent”). Compute the derivative of y = arctan z. 


y=tan x 
¢ 
4 
‘ 4 
2 y=arctan x 
Za 


Fig. 6.31 arctan and arcsing 


Solution: 


We have tanu =z and (tanu)’ = 1+ tan? u, and therefore, 


1 
L+a2 


(6.21) 


(arctan)’(tan uw) = (arctan)'(2) = 


® Remark: Analogously and according to Formula (6.19), the derivative of the 
inverse function of tanh is 
1 


(artanh)’ (a) = Toa?’ 


(6.22) 
This function is called the “area hyperbolic tangent”, and it is frequently used in 


integral calculus. ® <<< 


>>> Application: derivative of the arcsine (Fig. 6.31, right) 
The inverse of the function y = sinz is called the arcsine (“the arc of the 
sine”). Calculate the derivative of the arcsin function. 


Solution: 


We find { ( ; 
(aresin)’(sin wu) = = 


(sin u)’ cosu /] sine 


Now, we set sinu = x and find 
1 
2 


1-2. 


(arcsin x)’ = (6.23) 


® Remark: We may reflect the sine curve only in the interval [0, 27]. Otherwise, 
y = arcsing is not one-to-one. ® daa 
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>>> Application: derivative of the logarithmic function 
The inverse of the exponential function y = e” is the natural logarithm (loga- 
rithmus naturalis). Compute the derivative of y = Inz. 


Solution: 
We compute 


(Iny'(e")= a= =. 


(e")! eu 


Now, we set e” = x and arrive at the simple formula 


(Inz)! = a (6.24) 


ddd 


Using Formula (6.20), we want to prove the 


power rule: (#”")/ =n-2""' (valid for all n€ R). (6.25) 


Proof: According to Formula (6.20), we can also differentiate the “root functions”, 
since the inverse function of the power function y = x” is the root function 


y= in = Va. 
By the theorem on the derivatives of the inverse functions, we find 
(vary = 


We substitute u” = x and apply the rules for power functions: 


1 


n-yur-t 


ur tag agli 
Therefore, we have 
4 1 1oa_ 
(x*) = (Vm)! = —_ = =a"! 
nx n 


1 

® Remark: We do not need to learn this rule by heart: If we set — = m, then the 
n 

formula takes on the well-known form 


1 1 


(2) =m-2"" with m=-. @ 
n 


By means of the product rule, one can easily show that this formula also holds true 
; 1 
if one sets m = — = k—. So, the rule applies to all rational numbers — and, since 


n n 
each real number may be approximated with arbitrary precision by a fraction, even 
for all real exponents (for n = 0, because ° = 1 is constant, the derivative is 0). © 
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® Remark: Relatively often, one has to differentiate y = \/z = ae. By Formula (6.14), 


we find y’ = 5 -g73 = Je We want to keep this in mind: 
ar pe 6.26 
Bf ae) 
® 
>>> Application: hyperbolic functions (Fig. 6.28) 
Using the function e”, the so-called hyperbolic functions are defined: 
Let ee +et 
sinh x = — cosh z = 5° (6.27) 


Show that (sinh)! = cosha and (cosh)! = sinh. 


Solution: 


2 wy! 1 
(< : )=5e Ca) = eshe: 
1 


(— 
2 


® Remark: By means of the imaginary unit i (i? = -1) (see Section B), one is able 


‘) 5 (e” #e°*(-1)) = sinh. 


to obtain quite analogous formulas for the derivative of the ordinary sine or cosine 


function: 
4 e l® elt + e ’® 
sin x = —————, cosx = ————. 6.28 
a” 2 eee) 
Again, you can easily show the already known relations (sin 2)’ = cos and (cos)! = 
—sinx. ® aa 


>>> Application: derivative of an exponential function 
Show the following formula for the derivative of general exponential functions: 


(oy =Ine- a". (6.29) 


Solution: 
Obviously, a = e!™%, and thus, 


xz 
os (m2) eee _, (q?)! = e™@? Ina =a? - Ina. << 


If the nesting of functions becomes more complex, the chain rule must be 
applied repeatedly. Even for that, we should do some exercises. 


>>> Application: more examples using the chain rule 
Through repeated application of the chain rule, determine the derivatives of 
the functions 


y= ent y=In(1+e”), y=cos(3sinz?+1), y= arcsin(cos2z). 
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Solution: 
y= fin 2x = y’ = sin 2a -cos2n- 2, 
1 


oe oD. 
1+ e2 


y=In(1+e”") >y' = 


y = cos(3sinz? +1) > y’ = -sin(3sinz? + 1)- (3 cosa”) - (22), 
1 
V1 - cos? 2x 


>>> Application: harmonic oscillation (Fig. 6.32) 

We move a point P with constant angular velocity w on a circular orbit 
(radius r) and consider its normal projection Q onto the y-axis. The projected 
point Q oscillates harmonically. If vy is the polar angle of the point P and t 
is the elapsed time, then the y-coordinate of Q equals 


y = arcsin(cos 2x) > y’ = - (sin 2x) -2 = -2. << 


y=rsing=r sinwt. 


Find the instantaneous velocity and the instantaneous acceleration of Q. 


Solution: 

The path of Q is given as a function of the time t. Consequently, we need to 
form only the “first and the second derivative” with respect to time (y and 
¥). Once the chain rule is applied, we find 


y=rsinwt > y=rw coswt > j= rw (-sinuwt). 


Thus, we see that the orbital speed and acceleration of Q vary harmonically. 
The instantaneous velocity increases linearly with the angular velocity of P. 
However, the acceleration increases with the square of w. 


Fig. 6.32 harmonic oscillation 


® Remark: The orbital speed of the point P is v = rw. The circumference of P’s 
orbit is U = 27r. This is the orbital period of P: 
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which is the time that the point Q needs to carry out a period of the harmonic 
oscillation. 


2 
In practice, this time is often given. From this, the angular velocity w = 7 follows 


and one obtains 


: . ant 
y=r sinwt =r sin —, 
ale 
. Qa rT Qrt Qrt 
y=rw coswt = cos =U COS ; 
L Tf oi 
2 4n?r . Ont _ Ar? 


y=-Trw sinwt = 


sin y. 
T? ou i he 
The orbital speed and the acceleration vector of Q are the normal projections of the 


corresponding vectors of the point P onto the y-axis. The centripetal acceleration 
An? 


of the point P is obtained with the expression <<< 


>>> Application: At what moment do the two shafts of a universal joint 
have the same angular velocity? (Fig. 6.33, Fig. 5.67) 


Solution: 
® Remark: In Application p. 219, we derived a formula for the output angle § for a 
given drive angle a of a universal joint (where y is the constant angle between the 


two axes): 


1 tana 
tan 3 = tana => 6 = arctan : 
cos Y cos Y 


We assume that the drive axle rotates with constant angular velocity w. After time 
t, the drive axle has rotated through the angle a = w-t. Then, for the output angle, 
we have 


tan wt 


y(t) = arctan with c=cosy. 


d 
The derivative y = = is now the instantaneous change of the output angle dy in 


time dt. Therefore, it is the angular velocity of the output shaft. We want to examine 
only the ratio of the two angular velocities, and therefore, we set w = 1. 

Now, we compute the derivative y according to the chain rule. Formula (6.21) is used 
for the derivative of the arctangent, and Formula (6.16) is used for the derivative 
of the tangent: 


tant\’ /tanty’ 1 1 2 1+tan?t 
y = {arctan )-( \e --(1+tan*t)=c ; 
e ( c c (eet) c ( ) c2 + tan? t 
c 
a ; . 1 1 ; T 
At the initial point t = 0 (a = 8 = 0), we have y(0) = - = . At time t = = 
Cc cosy 2 


(a= B= 7 = 90°), we have WD) = c= cosy, because the fraction 


1+tan?t _ rane ae 


c2+tan?t c 
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a w(t) 


=45°(c = cos 45° = 0.7071) 


Fig. 6.33 the drive angle and ratio of the angular velocities 


T 
converges towards 1 for t > 5 (=> tant > oo). 


Now, we can calculate when the instantaneous angular velocities are equal: We have 
to obtain y = 1: 


14+tan?t 
Cc 
c2 + tant 


=1>c+ctan?t=c?+tan?t = tan” ¢t(c- 1) = c(c- 1) 


tana 1 
> tant=tana=4+/c=>tan@=+4 = 


c Je 


Numerical example: y = 45° > c= cos 45° = 0.7071 (Fig. 6.33). 

The ratio of the angular velocities in the starting position a = 0° takes the value 
y(0) = 1.4142. For a = 40.06°, 6 = 49.94°, it equals 1, and for a = 90°, it decreases 
to (5) =0.7071. © <<< 


>>> Application: piston speed for gasoline engines (Fig. 6.34) 

Analyze the speed ratios of the given slider-crank mechanism. 

In Application p. 139, we have determined the distance y of the piston pin 
in the gasoline engine as a function of the rotation angle a of the crankshaft: 


y=rcosa+Vs?-r?sin7a. 


During some rotations with constant angular velocity w, the angle a can be 
assumed to be proportional to the time t. Thus, 


Qa=wt. 


Therefore, we have 


y=r cos(wt) + \/s?—r?sin?(wt). 
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Fig. 6.34 velocity and acceleration analysis for gasoline engines 


Since we want to analyze qualitatively, we may set w = 1. The first derivative 
with respect to t provides the instantaneous speed of the piston: 


2r? sint cost ( r cost 
———— = r sint | ———__—_.- 1]. 
QV 82 — 2 sin? t s?—r2 sin? t 
Therefore, for sint = 0 (t= a = 0,7, 27,...), ie. at the highest and lowest 


position respectively, the piston stands still for a moment. From this fact, 
one easily deduces that y can have no other zeros. Then, it would need to be 


y= -rsint+ 


r cost 
2 


-1=0 
82 —r2 sin? t 


which is never the case if s>r. 


Fig. 6.35 crank mechanism with a hydraulic saw 


It becomes more interesting when the piston has maximum velocity. There, 
the acceleration has to change sign so that it equals zero. To this end, we 
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would have to differentiate again and then set y = 0. In this specific case, this 
is already a very arduous task. 

Fig. 6.34 shows the graphs of the functions y, y, and y, which were created 
with the program otto_analysis.exe. Only the equation y = y(t) was ente- 
red. The computer can “automatically” differentiate and also finds the zeros 
of these functions, as we will discuss soon. 


® Remark: Slider-cranks allow an interplay between an up-down motion and a ro- 
tation. Therefore, they are used whenever a transmission between a rotation and a 
harmonic oscillation is to be performed. One can drive the blade of a hydraulic saw 
via a water wheel (Fig. 6.35). The technology of the saw developed by Leonardo da 
Vinci in detail is still used today. ® <<< 
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6.5 Differentiating with a computer 


Differentiating is — as we have seen — a relatively simple process, because only 
a few rules are needed and they can be applied systematically. However, when 
differentiating repeatedly, the results are often quite complex. Proceeding to 
make exact calculations with such results (like in Application p. 276) often 
requires a lot of mathematical experience. If these results seem to be too 
complex, we will sensibly use a computer to carry them out. 

A computer “can” differentiate if the equation y = f(x) is an explicitly known 
function. We distinguish between the exact differentiation that occurs with 
algebraic methods and approximate differentiation. Both methods are of gre- 
at importance. 

When approximately differentiating, one can make use of the formula 


_& | f(zte)-F@) 
~ dx € 


f"(2) (6.30) 
with a “very small” ¢, for example, ¢ = 10-!?. However, this involves the 
risk that a “loss of precision” occurs, because although a computer is very 
accurate, it may not be sufficiently accurate for such tiny differences. 

The formula can be modified, and then, it generally delivers much more stable 


results: 
_ df. flee) - flee) 
dx Qe ; 


f(x) (6.31) 
Now, a comparatively “much larger” ¢ is sufficient, approximately ¢ = 1077, 
in order to achieve an equally good approximation. Consider the following 
example: 


>>> Application: loss of accuracy with numbers dwindling 

The exact derivative of the function f(z) = tanz is the function f(x) = 
1+tan? a. The value f’(0.5) shall be calculated precisely or with the stated 
approximations given in Formula (6.30) or Formula (6.31). 


Solution: 

The “exact” value is: f’(0.5) = 1.298446 4104095. The approximations for 
e= 10-7 are f’(0.5) » 1.298 446 481 154 and f’(0.5) » 1.298 446 410333. For- 
mula (6.30) is accurate approximately to only 7 decimal places. However, 
Formula (6.31) is accurate to 9 places. Yet, if one believes that greater ac- 
curacy may be achieved by decreasing ¢, one is mistaken: For ¢ = 107!°, the 
approximations agree only to 5 or 6 decimal places. This gets worse for even 
smaller ¢: For ¢ = 107!8, only 3 decimal places are exact, and for « = 10° 
only 2. For e = 107!°, both cases result in 1.30, and for ¢ = 107'®, “the system 
collapses”, which means that the results are completely meaningless. 
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Specifically, Formula (6.30) with ¢ = 10° (still only 7 decimal places) and 
Formula (6.31) with ¢ = 10~® (10 decimal places) yield the best results. <«« 


Application p. 279 and also the following example show that one should not 
blindly trust a computer. It does not compute with “arbitrary precision”, and 
losing numerical precision will lead to strange results! 

The second derivative can be calculated approximately from the first: 


fere)-f"(w~2) 


F"(@) = (f'@)) » > 


(6.32) 


We point out that such approaches are numerically not very reliable. In fact, 
the choice of € is very critical, and sometimes it depends on the nature of the 
function. An alternative to Formula (6.32) is the easily derivable formula 


f(a) » [f(at+e)—2 f(x) + f(e-e)]fe?. 


Here you can see that the square of ¢ appears in the denominator and should, 
therefore, not be chosen too small. 


>>> Application: strong loss of accuracy in the second derivative 
Calculate the exact second derivative of the function f(a) = tana and com- 
pute f”(0.5) exactly or with the given approximation Formula (6.32). 


Solution: 


We have f’(x) = 1+ tan? «, and by differentiating once again according to 
the chain rule, we obtain 


f(z) =0+ 2tanxz - (1+tan? 2). 
— —— 
ext. der. inn. der. 


The “exact” value is (0.5) = 1.418 689014. The approximation with e = 1074 
equals f’(0.5) * 1.418 689069 and agrees up to 6 decimal places. If ¢ = 10~°, 


the approximation agrees only up to 3 decimals. For ¢ < 107°, the result is a 
collection of irrelevant digits. <<< 


6.6 Solving equations of the form f(x) = 0 281 


6.6 Solving equations of the form f(x) = 0 


The intersections of a function graph y = f(x) with the z-axis y = 0 are 
called zeros or solutions of the equation f(x) = 0. Such solutions can only be 
calculated in special cases with formulas. In general, one has to be satisfied 
with approximations. Computer algebra systems (CAS) have modules with 
which one can calculate the solutions with arbitrary precision. 


"A 


(xi/yi) 


(vi+1/yit1) 


Fig. 6.36 zeros by polynomial approximation 


A given interval [a, 6] has to be specified in which the solutions are to be 
sought. In principle, for a sufficiently large number of equally distributed x- 
values x; € [a, b], the function values y; = f(a;) are initially calculated. If the 
signs of all y; are equal, then it is assumed that there is no zero in the interval 
[a,b]. In order to avoid missing any possible solution, one will — in the era of 
fast computers — test dozens or hundreds of values x; (see Application p. 42). 
If there is a change in the signs of the function values in the interval [x;, j41] 
(Fig. 6.36, right), then a zero has to occur between x; and 2;,1. If the z-values 
are close enough, one can replace the curve by the straight line spanned by 
the points (x;/y;) and (aji41/yi+1) and quickly find an approximation x9 for 
the zero: 


(*) +t bee = *) = @ > 19 =U, + (vist _ x) a ae Yi 


Yr ~ Yi Yield — Yi 


Only in windfall will yo = f(a) now be exactly zero. If |yo| is still too large, 
one has to repeat the process of the linear approximation to improve the ap- 
proximation of the zero. Now, use (xo/yo) as the first boundary of the newly 
calculated interval. The second boundary of the newly calculated interval is 
that point (either (a;/y;) or (%i+1/Yi+1)) which is located on the other side 
of the abscissa compared to the first point. 

If the function in the neighborhood of the root is “harmless” — for instance, 
it may not oscillate strongly —, then the following approximation method 
named after Newton converges very quickly: One chooses an initial value 
xo, to which the point (xo/yo) corresponds. In a first approximation, the 
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Fig. 6.37 zeros according to Newton 


curve is replaced there by its tangent whose slope is given by the derivative 
f'(20) = yo. If we intersect the tangent, instead of the curve, with the x-axis, 
we get a new value 7: 


1 
oo +t{ ,J= oi = 0 =219- ~. 
Yo Yo 0 Yo 


Most of these new values will be better approximations for the zero and it 
takes two or three repetitions of the operation (iterations) to quickly approach 
the zero. Yet, it is also possible to have bad luck and leave the interval [a, b] 
(Fig. 6.37). In this case, a different initial value has to be chosen, and one 
hopes — often rightly — that the process is so stable that it will already 
“stabilize” at the next iteration. Then again, if there are several zeros, it will 
not always be easy to find the right one. 

In practice, one can mix the two methods successfully: By means of the poly- 
gon method (linear approximation), a zero can be detected quite accurately. 
In small intervals, the very efficient Newton method will find the right zero 
with high probability. If not, the interval is to be reduced further. 


>>> Application: filling liquid (Fig. 6.38) 

One liter of water is poured into a hollow hemispherical bowl with an inner 
diameter of 20cm. How high is the water? 

Solution: 

The formula for the volume of a spherical cap 


V= ae (ar -h) (6.33) 


will be derived in Application p. 411 by means of Cavalieri’s principle (h is 
the height of the spherical segment). With r = 10cm and V = 1,000 cm?, we 
have 


h? 
1,000 = “> (30-h) = f(h) = 7h? - 30h? + 3,000 = 0. 


The equation has only one solution in the interval [0,10], namely h = 6.355. 
So, the water is about 6.4cm high. <<< 
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Fig. 6.38 spherical bowl with liquid Fig. 6.39 submersion of a sphere 


>>> Application: submersion of a sphere (Fig. 6.39) 

How deep does a ball plunge into water (density 9 < 1)? 

Solution: 

This example is similar to Application p. 282. For the sake of simplicity, we 
can set the radius of the sphere to be 1. The immersion depth ¢ is given 
relative to the radius. 

The volume of the sphere is, thus, V = 2, and the ball displaces the volume 


# 
Vw = “0 = t). 
Balancing buoyancy and weight force by equating them, we obtain 
An nt? 
— =—(3-?). 
aa le 


Therefore, we get 
f(t) =t?- 30? +40=0. 


® Remark: With 0 = 1-S;, the trivial solution equals t = 2, because f(2) = 0 (the 
ball sinks completely). If @ = 0.5, obviously f(1) =0 = t= 1. The immersion depth 
of the ball equals its radius, i.e. half the diameter. Fig. 6.39 illustrates the situation 
for @ = 0.63. In this case, t » 1.175 is the only solution in the interval [0,2], that is, 


the height of the protruded ball cap is about 5/6 of the sphere radius. ® <<< 


>>> Application: “folding” a circular sector into a cone of revolution 
From a circular sector of radius s = 10 cm, the “coat” of a cone of revolution 
is to be folded such that the volume of the cone equals 270cm?. Calculate 
the central angle of the circular sector. 


Solution: 
Let a be half of the angle of aperture of the cone of revolution. Then, for the 
radius r, the height h, and the volume V of the cone, we have 


meh © 3.2 
=> 8° sin” acosa. 
3 3 


r=ssina, h=scosa>V= 
Specifically, we have, therefore, 


= 10° sin? a cosa = 270 = sin? acosa = 0.25783. 
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2 


With sin? a + cos? a = 1, we have 


sin? aV 1 - sin? a = 0.25783 = «V1 — x = 0.257831 (with « = sin? a) 


or, after squaring both sides, 


a°(1— 2) — 0.257831? = 0 > f(x) = 2? — x? + 0.0664768 = 0. 


Fig. 6.40 folding and unfolding ... 


This equation has three solutions for « = sin? a, of which only the positive 
ones are relevant: 


21 = —0.232265 <0, x2 = 0.310506 x3 = 0.921759. 
We also have two real solutions for a: 
sin? a1 = 0.310506 => ay = 34.18°, sin? a2 = 0.921759 => ag = 73.76°. 


On the other hand, for the central angle w (measured in degrees), the circular 
sector — according to Formula (4.18) — yields 


w° = 360° sina. 
The two solutions for w are, therefore, 


w? = 202.2°, w§ = 345.6°. «a 


>>> Application: precisely timed simulation of a planetary orbit 
According to the first two of Kepler’s laws, the orbit of a planet P around 
the Sun S' is an ellipse with S as a focal point. The orbit is traced with 
permanently changing speed, but “equal areas are covered in equal periods 
of time” (Fig. 6.41). As simple as this theory may sound, it is still impossible 
to describe the elliptical orbit by means of elementary functions such that it 
is “parametrized by time”. 


6.6 Solving equations of the form f(x) = 0 285 


| 
Fig. 6.41 Kepler’s second law Fig. 6.42 virtual planetary orbits 


Kepler solved the problem by examining the two virtual planets besides the 
planet in question (Fig. 6.42): The “middle planet” A and the “eccentric 
planet” E. A moves on the same orbital ellipse c as P (center M, semi-axis 
lengths a, b), but with constant angular velocity — where A and P have the 
same total orbital time. The eccentric planet F traces the circumcircle co 
(center M, radius a) of the orbital ellipse c as a continual companion of P on 
a line parallel to the ellipse’s minor axis (Fig. 6.42). Thus, £ results from P 
by multiplying its distance to the principal axis by the constant factor a/b. 


Fig. 6.43 “stroboscope recording” Fig. 6.44 Galilei and his telescope 


Obviously, it is sufficient to calculate the course angle a of F and then mul- 
tiply P’s distance by the reciprocal factor b/a. Kepler deduced a relation 
between the unknown angle a and the course angle t of A which is propor- 
tional to the elapsed time. “Kepler’s equation” reads 


b\2 
f(a)=a-esina-t=0 with e=/1-(-). (6.34) 
a 
The numerical eccentricity ¢ that governs the shape of the ellipse is given 
in astronomic tables. When specifying the time t, a can be found with our 
method as the only real root of Formula (6.34). 
® Remark: Fig. 6.43 shows a number of positions of a planet for equal time intervals. 
The major axis of the orbital ellipse is the apse line. On this line, we find the 
perihelion (position nearest to the Sun) and the aphelion (position furthest away 
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from the Sun). Higher orbital speeds can clearly be observed near the Sun. For our 
Earth, the numerical eccentricity ¢ 60 is pretty small and the elliptical orbit is 
almost circular. Nevertheless, the winter season in the northern hemisphere is at 
least six days shorter than the corresponding summer season. In the annual calendar, 
this fact is compensated by the following agreements: 

1. February has only 28 days. 

2. Both July and August have 31 days. 

3. Autumn begins two days later than the other seasons. daa 


>>> Application: power function and exponential function 
When does the exponential function y = 2” “overtake” the power function 
y=x° (Fig. 6.14)? 


Solution: 
In the moment of “overtaking”, the following holds: 


9 ao? => f (2) Ss 2 2 S01. 


The two solutions are x; = 1.1773 and x2 = 22.4400. The value (2)° = g(a) 
is roughly estimated via 2!° » 10%, and it is pretty large, namely » 27-4-10° » 
5- 10°. 

® Remark: We have already said that any exponential function can overtake even 
the “mightiest” power function. Looking for the “point of overtaking”, one soon 
encounters numerical difficulties. For example, the method fails already in the com- 
putation of the zeros of f(x) = 2” - 2° = 0, though f(x) = 2” - 2° °® = 0 is still 
solvable (the zero a2 * 29, then, corresponds to the value 279 *0.5-101°).@  ««« 
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>>> Application: the problem with lilies ... 

Water lilies, which grow in a pond, increasingly spread across its surface. Say, 
for example, the area that they occupy grows by 10% every day, and it will 
only take 30 days until the entire pond is covered. How large was the initial 
area, and when exactly was half of the pond covered? 


Solution: 

Let S be the surface area covered by the water lilies on their first day and A 
the surface of the entire pond. On the second day, the lilies would cover an 
area of kS with k = 1.1 (increase of 10%), and on the n-th day, they would 
cover an area of k”S. From k°°9 = A, it follows that 9 = A/k®° ~ A/17.5. 
This means that, at the beginning, the pond was covered by less than 6%. 
Now we have to calculate the value of n from k"S = 0.5A: 

k"S = 0.5-k29S > k?-89 = 0.5 > n = 30+ log0.5/logk » 22.73. 

It would seem that the pond was covered exactly halfway on the 23rd day. 


Fig. 6.45 a near-closed area of reeds on the Neusiedlersee (Austria) 


® Remark: The problem of water lilies is typical of exponential growth phenomena 
in general, which, in nature, are manifested most dramatically in the reproduction 
of bacteria. Such growth is forced to collapse eventually in one form or another 
due to a limited amount of resources being available. In the case of the pond, it is 
especially easy to see why, as it makes no sense to cover it with another layer of 
water lilies (Fig. 6.45). @ <<< 


>>> Application: population statistics 

Let f(a) be the proportion of the population of a country which is only x 
years old (f(a) €[0, 1]). This function is described by means of the illustra- 
tion in Fig. 6.46. 
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Fig. 6.46 proportion f(a) of the population at the age of x in Austria (2003) 


Solution: 

If the age classification is only accurate up to one year, then f(z) is not a 
smooth function, but a “staircase function” (“bar charts” with a bar width 
of 1). In “developed countries”, the proportion of the population f(a) decre- 
ases anywhere from 80 and onward: f(100) is already almost zero — even in 
nations where “many” people are more than 100 years old. Therefore, one can 
limit the maximum human age to x 100 years. It is striking that, in a typical 
European country such as Austria, the number of newborns decreases conti- 
nuously. In a typical developing country, this is reversed. In these countries, 
however, the mortality — also and especially at a young age — is much higher. 
The proportion of people who are i years old equals f(z). Since each person 
has exactly one age, we have 


yA) 21. 
1=0 


The area under the function graph has, therefore, the value 1. In the diagram, 
the ordinate is extremely superelevated, because otherwise, we would not be 
able to accurately see anything and the diagram would be useless. <<< 


>>> Application: reconstruction of date and time 

The vector equation (5.52) for the light ray corresponding to the date a (a 
measures the number of days from March 21) at the time z allows us to 
calculate a and z if the (normalized) direction vector So(sz,5,,5z) pointing 
towards the Sun is given together with the latitude y. First, we calculate the 
auxiliary variables w and yu as 


Sx sin? w 
w = arctan ——, p=arctan 
Sz COS — Sy Sin (py ba 


a (6.35) 


Then, z and a are as follows (a is ambiguous and corresponds to the an- 
gle a®): 


z=12- 5°” ay = arcsin a5 = 180° - ay. (6.36) 


ai. 
23.44°’ 


ddd 
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>>> Application: monetary value at various times 

Three people are making an offer for a house: A will immediately pay 200,000 
€, B wants to pay 100,000 € as a first instalment and procure 120,000 € in 
4 years, C proposes 230,000 € in 3 years. Which offer is best for the seller if 
we calculate an annual interest rate of 5% and 3.5% respectively? 


Solution: 

We relate all monetary values to the time “4 years ahead”: A 5% interest 
rate will then yield the following amounts for the three respective offers (in 
100,000 €-units): 

A=2.0- 1.054 » 2.43, B =1.0- 1.054 + 1.2 = 2.415, C = 2.3-1.05! = 2.41. 

The three deals are almost equal, but A offers the most to the seller. With 
an interest rate of only 3.5%, the seller can expect to get: 

A = 2.0- 1.034 » 2.25, B = 1.0- 1.034 + 1.2 = 2.33, C = 2.3-1.03! = 2.37. This 
time, the margins are a bit larger and C is the best bidder. <<< 


>>> Application: cold coffee (Fig. 6.47) 

If we are in a hurry in the morning, we may drink our coffee in one gulp 
immediately after pouring it into a cup. To avoid burning our mouths, we 
might add some cold milk from the fridge. However, if we decide to let several 
minutes pass between pouring the coffee into a cup and drinking the “melan- 
ge” after adding the milk, the following question arises for connoisseurs who 
wish the mixture to be still as warm as possible: Assuming that we add the 
same amount of milk in any case, should we pour the milk into the coffee 
right away or just before drinking? 


Fig. 6.47 cold or warm milk? (right stereo image, see also Fig. 5.50) 


Solution: 

We have already discussed the cooling process in Application p. 251, where 
we have computed with temperature differences. Let k be the temperature 
of the black coffee in the beginning, and let m (m < 0) be the temperature 
of the milk. The quantity of milk is a fraction q of the amount of coffee. 
According to the law of energy conservation, the temperature difference of 
the “mixture” is 

Aw k+q m 
1l+q 
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The latter decreases after t minutes by a factor of = \' (with \< 1=> p< 1) 


to 
k+qm 


Ay =pA= : 
t= ee, 


If we mix after t minutes, then we have 


Tee (uk) + qm 
l+q 


Then, we form 
qm 


At - At (u- 1). 

Since m < 0 and xz -1< 0, this expression is positive, which means that the 
“mixture” has cooled less. The difference is more noticeable the warmer the 
room (because then |m| is greater) or the more milk is added. The term is 


~14q 


obviously independent of the initial temperature of the coffee. 


® Remark: On the right in Fig. 6.47, one clearly sees the so-called focal curves. These 
are parts of curves that are enveloped by all reflected light rays. See the section on 
envelopes. ® <<< 


>>> Application: spline curves 

We want to interpolate n+1 points P;(ui/v1),..-,; Pa+i(Un+1/Un+1) (so-called 
“knots”) by a “cubic spline”. We are confronted with this task, for instance, 
when we want to design smooth curves (“free form curves”) in a CAD system. 


Fig. 6.48 composition of a cubic spline from cubic parabolas 


Solution: 

In Application p. 33 in the section on systems of linear equations, we compu- 
ted a parabola of degree n on n +1 points. The higher the number of points 
to be interpolated, the higher the degree of the curve and the more oscil- 
lations occur (especially at the end points). For that purpose, the following 
procedure is frequently used in computer graphics (Fig. 6.48): 

Consider four adjacent nodes P;_1, P;, Pis1, and Pj:2 and the tangents at 
these points (initially arbitrarily selectable). Then, two neighboring points 
plus their tangents define a unique cubic parabola (since a tangent together 
with its point of contact counts for two points). Adjacent cubic arcs are 
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tangent to each other (this is frequently called ' continuity). We shall soon 
see that the tangents can be chosen such that the cubic parabolas have the 
same curvature at the transition points (this is called G? continuity). All n 
parabolic arcs are then combined to form a cubic spline. 

For the i-th parabolic arc s; from P; to Pi+1, we make the ansatz 


y=tt+ojt+at+d; (@=1,...;7): 


For each parabola, this will yield four unknowns a,;, b;, c;, and d;, that is, a 
total of 4n unknowns. The value t = 0 will correspond to P;, which already 
yields n indeterminates: 


uj =d; (G=1,...,n). 


That leaves 3n unknowns a;, bj, c;, which can be calculated if one takes 3n 
linear equations independent from each other. 
For the parameter t = 1, we will get the point P;,1: 


Vis] = Aj +b; +c; +d; > a; + 0; + Ci = Uj41 — Ui (i = 1,...,n) (6.37) 


(n linear equations). 
Now, for the tangents: We compute the first derivative 


y = 3a;t7+2b:t+¢ (i=1,...,n) 


of the cubic parabolas. We need the tangent of a neighboring parabola, for 
example, the left s;_; that exists only for 7 > 1: 


y= 3aj-1 i 2b;-1 t+ G-1. 


There, the same value must be achieved for t = 1 on the i-th parabola s; for 
the parameter t = 0: 


3aj-1 +2 bj-1 + Cj-1 = CG (i = 2: rere n) (6.38) 


which yields a further n —- 1 linear equations. 

Now for the second derivatives, which, together with the first derivative, 
determine the curvature of the curve at t = 0: 

The second derivative of the i-th parabola 


Yy =6a;t+2b; 


equals that of the (i-1)-th parabola at t = 1. With the same considerations 
as before, we get 
6aj-1 + 25)-1 = 25; (¢=2,...,n) (6.39) 


which are an additional n - 1 linear equations. Overall, we have now found 
3n-2 linear equations. The last two can be found by prescribing the second 
derivative at the start and end point, and thus, 


2b, =0, Gan +2bp =0. (6.40) 
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One can solve this (3n,3n) system of linear equations efficiently because of 
all the zeros in the determinants (the best method was invented by Gauf). It 
turns out that the computation time is only linearly (and not quadratically 
as in the general case) increasing with the number of nodes. 

® Remark: Normally, cubic splines look as we expect them to look (which is import- 
ant in the design of free-form curves). 

One should choose relatively equal distances between the control points. A dis- 
advantage, however, is that any change of a control point affects the curve only 
locally. Therefore, other spline types (for example, Bezier splines) are offered by 
CAD programs, which we will not discuss here. © << 


>>> Application: approximation of a point cloud through a straight line 
Show that the points P;(21/y1), ..., Pn(@n/Yyn) in the plane uniquely define 


n 
the linear fit r: y = kx through their “center of gravity” S(Z/Y) = 1 x P; 

i=l 
with the coefficient k = 57, (2@;-2)(yi-9)/ X11 (ai- 7)? that minimizes the 


sum of squared distances to the data points. 


Proof: Let the straight line r be given by y = ka +d (Fig. 6.49). The obvious next 
step is to move r through the “center of gravity” S(Z/y) of the point cloud, thus 
d=y-kz. 


| 


(pe 
ax 


Fig. 6.49 linear fit 


The normal distance Pr of a point P(2/y) from the line r with the angle of in- 
clination a = arctank (with respect to the x-axis) is proportional to the difference 
Ay; of all y;-values of all points P; and the corresponding points on the line r at x; 
(Pr = Ay; - cosa). So, it is sufficient to choose k in 


V(Au)? = Vly - (kai + a)? = Vly - (kai + Y- hz)? = 


= Vii - 9)? + k? (xi - 2)? - 2k(yi -9)(ai -7)] 
such that we obtain a minimum. For that purpose, we have to find the zeros of the 
derivative with respect to k. 


i[2k(xi - B)? - 2(yi - 9) (wi — B)] = 0 kN (as - F)? = (i -B)(@i - 8), 


whereby the above assertion has been shown. ©) 
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® Remark: The linear fit plays an important role in statistics (Application p. 497): 
If the deviation of the points from the line is small, one can make predictions about 
events beyond the given data. ® <<< 


>>» Application: strain on an insect wing (Fig. 6.50) 
Insects rapidly flutter their wings. Guess the maximum acceleration of the 
outer parts of the wings of the depicted hawk-moth (8 cm wingspan). 


Solution: 
The wing motion is ideally “harmonic”. The path-time diagram of the wing- 
tips is, therefore, a sine curve. 


Fig. 6.50 hawk moth in flight 


Let the wingtips have an amplitude of s. The frequency of flapping is n. 
The function y = s sin(27nt) oscillates n times up and down in the time 
interval [0,1]. So, it is sufficient to simulate the situation within one second. 
The acceleration is obtained by differentiating twice with respect to time: 
¥ = —s(27n)? sin(27nt). The maximum acceleration is, therefore, 47? sn? ~ 
40 sn’. 

In fact, for the depicted butterfly, the roughly estimated values s = lcm = 
0.01m and n = 60 result in a nearly 150-fold gravitational acceleration. Similar 
values also appear for the wings of hummingbirds. Bees have a frequency of 
about n = 200 and an amplitude of s = 2mm = 0.002m. The maximum 
acceleration for the outer wings is, then, about 300g. 


® Remark: Insects can definitely cope with larger accelerations. During their rapid 
turning manoeuvres, flies and dragonflies can stand up to 30g with their whole 
bodies. The wing parts are much more resilient because they do not contain any 
vital organs. The smaller the insect, the greater the resilience: The cross-sectional 
areas responsible for the stability relative to the weight are substantially greater, as 
we have discussed in detail in Chapter 3. Short term accelerations of 300g are no 
problem for a bee’s wing. ® <<< 


>>» Application: efficiency of a wing or fin stroke (Fig. 6.51) 
The so-called Strouhal-number S' corresponds to the frequency of a wing beat 
or flapping multiplied by its amplitude and divided by the forward velocity. 
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The highest efficiency is achieved at values between 0.2 and 0.4. This number 
is used to describe the locomotion of flying and floating animals — regardless 
of their size — so, ranging from mosquitoes to birds, bats, and even baleen 
whales.? 


~~ 3 = =—— — ———e c 
Fig. 6.51 different frequencies by huge and tiny warm-blooded animals 


For the hawk-moth in Application p. 293, we obtain 


ie. the greatest efficiency at an airspeed of about 2m/s. If an insect or bird 
is in the air (Fig. 6.51, right), the Strouhal-number assumes an infinitely 
large value, which means “zero efficiency” — at least in terms of overcoming 
distances. <<< 


>>» Application: detailed analysis of a landing flight (Fig. 6.52) 
Especially with smaller birds, the landing flight can happen in a flash, and it 
is difficult to make out any details with the naked eye. For this reason, such 
a landing flight shall be examined in close detail here by using biophysical 
methods of slow-motion analysis.4 


Fig. 6.52 landing flight of a blue tit 


The image series Fig. 6.52 was originally shot at 250 frames per second (see 
also Fig. 3.69). Pictured here and highlighted in the graph Fig. 6.53 is every 


Shttp://www.nature.com/news/2003/031016/full/news031013-9.html 
4From Glaeser, Paulus, Nachtigall: The Evolution of Flight, Springer Nature, Heidelberg 2017. 
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third frame, that is, frames number 0 — 3 — 6 etc. The interval At that has 
elapsed between one trio of frames and the next thus equals about 1/85 s. 
Since it is not possible to determine a centroid, the centers of the eyes in 
subsequent images have been drawn over each other to create a distance- 
time graph s(t). Between each trio of frames, the average landing speed 
v = As/At has been calculated by difference analysis and is plotted as u(t) 
both in meters per second and kilometers per hour. An approximate delay 
—b = Av/At(t) may also be calculated and plotted as —b(t) in meters per 
square second and in units of Earth acceleration g. 

This comparison is quite instructive (the numbers used in the evaluation are 
rounded to two decimal places, as is common in technology; the accuracy of 
the evaluation on the basis of the drawings is, of course, considerably lower). 
The blue tit lands in a slightly wavy line, which is inclined downwards at an 
average angle of no more than 15°. Shortly before frame 0, the bird makes a 
braking flap with its wing, then two more on its way to the landing site, and a 
third weak flap upon touching down with its feet (frame 18, cf. drawing). One 
would think that the landing velocity is abruptly reduced with each braking 
flap of the wings. As the graphs show, this is the case to some extent. The 
braking flaps cause fluctuations in the v(t)-curve, but the fluctuations are 
not particularly strong. These flaps may produce some kinks in the curve, 
but in general, the landing velocity decreases at a constant pace. This is due 
to the inert mass of the bird, which puts a damper on any rapid changes. 
Between frames 0 and 3, the blue tit is landing at a velocity of about 2.34 m/s 
or 8.42 km/h. Up to the moment when the bird touches down with its feet, 
that is, over a flight distance of 12.7 cm, which the tit crosses in 0.072 s, 
aerodynamic braking alone causes the velocity to be reduced to 1.21 m/s or 
4.36 km/h, that is, by half. 

In the remaining images until the bird comes to rest upon landing (frame 18 — final 
position), the work is done by the bird’s legs, which are initially stretched out at an 
angle of 145° and are gradually folded in until they hang at an angle of about 45°. 
These legs are astonishingly effective as landing devices and shock absorbers. 

The delay can only approximately be determined by means of “manual” differential 
calculations. Due to the braking flaps of the wings, it fluctuates considerably. Over 
a flight distance of 12.7 cm, that is, until the moment of the feet’s touch-down, the 
tit slows down from 2.34 m/s to 1.21 m/s within 0.072 s. The average delay thus 
equals -b = (2.34 — 1.21)/0.072 = 15.69 ms~? or 1.60 g. 

When comparing individual frames, however, the delay may vary considerably, na- 
mely between 22.10 ms~? (which equals 2.25 g) within the first trio of frames and 
41.65 ms~? (which equals 4.25 g) within the final trio. So, over this short distance 
of braking flight, the delay changes by a factor of 1.89. During its landing, the bird 
even accelerates slightly, but this may not be the case in other pictures of a blue 
tit’s landing flight. 

Upon touching down with its legs, the bird slows down further until it comes to 
rest. The deceleration forces thereby generated must cushion the impact of the 
landing legs. It has already been mentioned that the legs are bent in by about 100° 
upon touch-down. The leg extension muscles, which are already activated and thus 
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8 (distance) 
v (velocity) 
b (acceleration) 


Vmax %234mls 


Fig. 6.53 Computer evaluation of the image series at an interval of 1/250 s. Green: 
distance-time-diagram, blue: velocity-time diagram, orange: acceleration-time-dia- 
gram (mirrored on the abscissa). Every third frame (1/83 s) is numbered. Initially, 
only these frames were “manually” evaluated, but the final outcome already became 
apparent at this point. 


prepared for the strain, as well as their tendons, are further stretched in the process, 
like a previously stretched tension spring, on which further weight is attached. At 
the same time, the bird tilts clockwise around its lateral axis and thrusts its head 
forward until it comes to rest in the position drawn with dashed lines. In the final 
phase, other muscles, especially those in the back, also help to alleviate the impact 
of the touch-down. The energy absorbed in the process is released as heat into 
the environment unless it is used to change the position of the bird’s trunk. The 
deceleration in the final phase amounts to an average of 2 g. Good “shock absorbers” 
are needed to alleviate such an impact! 

The seemingly effortless landing process does not just “happen by itself,” but it must 
be precisely controlled and navigated, which is enabled by a complex interplay of 
sensory organs acting as sensors on the one hand and flight, leg, and trunk muscles 
acting as effectors on the other hand. And this must all take place within a timeframe 
of just a few milliseconds. daa 


7 Curves and surfaces 


This chapter can be seen as a continuation of the chapter concerning vectors. 
It will introduce so-called matrices (especially rotation matrices), which can 
be used to provide elegant descriptions of rotations in the plane, as well 
as in three-dimensional space. Naturally, we will often speak of congruence 
motions, which are the subject of kinematics. 

The fundamental theorem of planar kinematics states that two positions of 
one object can always be transformed into each other by means of a single 
rotation (or translation). The spatial analogon (the fundamental theorem 
of spatial kinematics) states that two congruent positions of an object can 
always be transformed into each other by means of a single helical motion 
(eventually a rotation or a translation). In the limiting case, we even speak 
of instantaneous rotations or instantaneous helical motions. 

This chapter contains important examples of mathematically describable cur- 
ves (such as conic sections) and surfaces. We will focus, in particular, on their 
generation via motions and on the various applications that can be found in 
nature and technology. 

Rigid body motions often produce envelopes of curves and surfaces, which 
we will also study with differential calculus. Envelopes (curves or surfaces) 
play a fundamental role in the study of motions. 


© Springer International Publishing AG 2017 
G. Glaeser, Math Tools, https://doi.org/10.1007/978-3-3 19-66960-1_7 
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7.1 Rigid body motions 


In this section, we will focus more closely on translations, rotations, and he- 
lical motions. These are the “shape preserving” transformations (isometries) 
in the plane and in space which do not change the relative position of points 
on the moved object. 

Vector calculus will be augmented by calculations involving rotational ma- 
trices. 


Translation 


Let P(pz/py/pz) be a point in space with the position vector j. The parallel 
displacement (translation) of the point P towards P; (with position vector 
p) along the translation vector ¢ is accomplished by a simple vector addition: 


. Dy + tx 
7,=p+t or, more detailed, p, =| py +ty |. (7.1) 
Dz + tz 


In R?, the third component is simply zero. 


Rotation in E? about the coordinate origin 


If the point P(p,/p,) is rotated about the origin through the angle y, then 
the new point P; has the position vector (according to Fig. 7.1) 


3 cos — sin 
Te ( ds ia (7.2) 
sin Y pz + COSY Py 


From now on, we will use the short notation 


~ COS P Px — SIN Y Py _ (cosy —siny ek o 
1 oe) ce o Dp (y) -B. (7.3) 


The number scheme 


sin y COs 


R(y) = bee a (7.4) 


is called a rotation matrix. In the next section, we will discuss matrices in 
much greater detail. 

>>» Application: refracted light (Fig. 7.2) 

According to Snell’s law of refraction (the equation is derived in Application 
p. 378), the following holds true for the angle of incidence a and the angle 6 
of the refracted ray: 


sina 


sin 3 


. sina 
n= const > 2 = arcsin (7.5) 
n 
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2 
5 
(os 
3. C 
a C) 
B 
Fig. 7.1 rotation about the origin Fig. 7.2 light refraction 


where n is the material-dependent refractive constant (n = 4/3 for the tran- 
sition from air to water). The ray that hits the surface of the water has the 
direction [ = (-1,-2). What is the direction of the refracted light ray 7 in 
the water? 

Solution: 

From a = arctan1/2 = 26.565°, we infer G = 19.597°. The vector has to 
be rotated through the angle y = a - 8 = 6.968°. Since siny = 0.1213 and 
cos y = 0.9926, we have 


__ (0.9926 - (-1) - 0.1213-(-2)\ _ [-0.750 
"* \0.1213- (-1) + 0.9926 - (-2) } ~ \-2.107]" 


Rotation in E? about an arbitrary point 


Let C(c,/cy) be an arbitrary center of rotation (position vector é), and let y 
be the angle of rotation. Then, we need three steps in order to describe the 
rotation of P about C: 
1. translation of P by -é 

+ auxiliary point H with the position vector h= p-e, 


2. rotation of H about the origin 
+ auxiliary point Hj, with the position vector hy = R(y)-h, and 


3. translation of Hy by é 
+ final result P, with the position vector p, = hi + ¢. 


All in all, we could also have written: 


Pi=R(y)-@-@) +2. (7.6) 


Main theorem of planar kinematics 


For planar kinematics, the following theorem, already mentioned in Applica- 
tion p. 195, is of great importance: 
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Any two positions of a rigid planar figure can always be transformed by rotation 
(or a translation) into each other. 


In the course of a “rigid body motion”, planar figures do not change their 
shape, and thus, the theorem is valid at every instant of the motion, regardless 
of the complexity of the figure. The instantaneous center of rotation is called 
the instantaneous pole. In a rotation, the orbits are always circles around 
the center of rotation, and therefore, the following theorem on the orbit’s 
normals holds true: 


At any instant, the path normals of a rigid body motion pass through a certain 
fixed point, namely the instantaneous pole. 


In an instantaneous translation, the paths of all points are straight lines, and 
so, the path normals are parallel. Examples can be found in the following 
application and Application p. 318. 


>>» Application: trochoidal motion (planetary motion) 

This motion is the superposition of two arbitrary rotations in the plane: A 
point R rotates about a fixed point F’ with constant angular velocity, and 
another point C rotates about the point R at the same time with the pro- 
portional angular velocity. Then, C' undergoes a so-called planetary motion. 
Show that the same motion can also be produced as the trace of a point on 
a circle rolling on another circle. Determine the radii of the circles. 


Fig. 7.3 trochoids 


Solution: 

First of all, let us consider the points F’ and R as fixed and as the centers of 
two touching rubber-coated wheels p and q (Fig. 7.3). If one turns p, then q 
automatically rotates (for example, w times as fast). A point C' that is firmly 
attached to q also rotates about R. 

Now you hold p steady and the axial connection FR of fixed-length s is 
turned instead. Then, q rolls on p, R rotates about F’, and the fixed ratio of 
the radii ensures the proportionality of the rotations. The point C’ describes 
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the desired trochoid. 


Fig. 7.4 cycloid (left), trochoid (right) 


The rubber coating on the wheels guarantees a rolling without gliding, and 
thus, equal arcs r are transmitted. Let rp and rg be the circles’ radii (rp+1rq = 
s). Then we always get rp, = wr,. From the two conditions, we deduce 

s sw 


r(w+lj=s => rR a 


w+ 


The ratio of the radii is, thus, indirectly proportional to the ratio of the 
angular velocities (rg: Tp = 1: w). Admitting also negative radii results in 
so-called hypocycloids, i.e. curves that are traced by “rolling a circle” inside 
the fixed circle (Fig. 7.4). In any case, the instantaneous velocity is zero 
at the point of contact of the circles, and therefore, it is the instantaneous 
pole, through which all path normals n are running (cf. the theorem on path 
normals p. 300). The path tangent t of the point C is, thus, perpendicular 
to its connection with P (see Fig. 5.28). 


® Remark: Actually, the name planetary motion comes from astronomy: The planets 
move around the Sun. In addition to the motion we observe, there is also the Earth’s 
own rotation. Thus, planet orbits appear more complex in the firmament than 
ordinary stellar orbits. ® 

Among the trochoidal curves, we find ellipses for the special ratio 1: -1 
(Application p. 316). Other special circumstances will yield a multitude of 
curves that play a role in some area of mathematics, such as the Cartesian 
quadrifolium in Application p. 43. <<a 


In space, we do not rotate about points. We rotate about straight lines. The 
simplest case is the rotation about the z-axis or about parallel axes. 


Rotation about vertical axes 


Such rotations can be traced back to the two-dimensional case. If a point 
P(pz, Py, Pz) rotates about the z-axis through the angle y, then the new 
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point P;, according to Formula (7.2), has the position vector 


COS Y Px — SIN Y Py cosy: p, — sing-py + O-p, 
Pi=|singp,+cosyp,|=|sing-p, + cosy-py + O-pe]. (7.7) 
Pz O-pr + O-py + Ip, 


i | 


Fig. 7.5 rotations about vertical axes 


Again, we use an abbreviation in the form of a rotation matrix 


cosp -siny 0 
Pi=|sinp cosy O0|-p=R,(y)-p. (7.8) 
0 0 1 


The rotation matrix R,(y) arises from the matrix R(y) in Formula (7.4) by 
adding a column and a row of zeros with a 1 on the crossing point, i.e. in the 
matrix’s diagonal: 


cosp -siny 0 
R,(y)=|]sing cosy Of. (7.9) 
0 0 1 
The rotation about a common parallel z-axis is accomplished by translating 


the z-axis, applying the above rotation (matrix), and applying the inverse 
translation. 


Rotations about arbitrary axes 


Swapping the coordinates yields the formulas for the rotation about the other 
two coordinate axes. 
For example, the rotation matrix for the rotation about the z-axis reads 


1 O 0 
Rx(y)=]0 cosy -sing]. (7.10) 
0 sing cos 


A rotation about an arbitrary line g through the origin by the angle yp can 
always be interpreted as a combination of special rotations. As Euler has 
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Fig. 7.7 universal joint in theory and practice 


shown, all rotations can be decomposed into a rotation about the z-axis, a 
subsequent rotation about the z-axis, and finally a rotation about the new 
z-axis. The corresponding rotation angles are called the Fuler angles. 


>>» Application: universal joint (Fig. 5.67) 

We recall the universal joint for the transmission of the rotation about an 
axis a — to a non-proportional rotation about an axis b intersecting a. The 
point A of the first fork is rotated through an angle a about the z-axis. 
The point B of the second fork rotates initially about the z-axis by an angle 
B = arctan “22 (see Section 2, Formula (5.48)) and finally about the z-axis 
rotated through the angle y. 

Let r be the radius of the forks. Then, the position vectors @ and 6 of the 


points A and B can be given by means of the matrix notation as 


re 


0 
A=Ra(a)-[0], b= Ra(a)-[ Ra(8)-|0] ] 


ddd 


>>> Application: oblique views of an object 

Explain the “photography machine” in Fig. 7.8 or Fig. 7.9. 

Solution: 

In order to view or photograph an object from all possible directions, one 
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Fig. 7.8 change of elevation angle and azimuth angle 


Fig. 7.9 change of target point and distance 


can proceed economically as follows: The camera can be rotated about a 
fixed horizontal axis, and the object can be independently rotated about a 
vertical axis. In this manner, you can set any line of view and thus create 
any orthogonal projection. 

Unfortunately, photographs are central projections. The target point (the 
point in the center of the image) and the distance (from the lens center to 
the target point) are of importance for the image. 4 


>>> Application: rotation of the firmament in the course of a day 

In the course of one day, the Earth will remain roughly in the same position 
on its orbit around the Sun (actually it rotates about 1° about the axis 
of the orbital plane). Relatively speaking, the Sun rotates (and of course, 
the positions of the “fixed stars” in the firmament) about the Earth’s axis 
(“around the North Star”). The projection of this axis into the horizontal 
base plane a points towards North and the axis itself encloses the angle y 
with 7, where ¢ equals the latitude of the current position (the lead encloses 
the complimentary angle 90° - y with the Earth’s axis). 


® Remark: Currently, the Earth’s axis points towards a star which we call the Pole 
Star (North Star). Over the course of approximately 25,700 years, the axis of the 
Earth rotates about a “gyro axis” perpendicular to the orbital plane (see for example: 
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http: //astro.wsu.edu/worthey/astro/html/lec-precession.html). So, when 
the ancient Egyptians aligned their pyramids precisely with the north around 4, 500 
BC), our present North Star was certainly not seen as a star whose position in the 
night sky remains fixed. @ <<< 


>>> Application: Why does the sky rotate in the reverse direction on the 
southern hemisphere? 

This question is not trivial at all: The answer “Because on the southern 

hemisphere we stand kind of upside-down” is wrong! 


Solution: 

The correct answer is the following: On the northern hemisphere, we have to 
look towards the North Star to perceive the Earth’s rotation. On the southern 
hemisphere, we must look in the opposite direction of the Earth’s axis (i.e. 
towards the southern celestial pole). A rotation in space cannot be oriented 
if the viewing direction is not given. 

® Remark: If we just make a headstand, this does not change the orientation. It will 
only result in a twist of one and the same image by about 180°. 

The Sun’s rotation seems to have the opposite orientation than that of the other 
stars. This is so, because if we watch the sun, we turn off the corresponding celes- 
tial pole. On the northern hemisphere, we look southwards. Thus, the orientation 
changes, and the Sun seems to rotate clockwise. ® da 


Rotations about arbitrary axes 


Rotations with axes that do not run through the origin can be accomplished 
analogously to the two-dimensional case as follows: If C is an arbitrary point 
on the axis of rotation, we first move the axis of rotation and the point to 
be rotated about —é. Then, we rotate about the displaced axis, and translate 
everything back (translation vector @). 


>>» Application: rotoidal motion (Fig. 7.10) 

A rotoidal motion is the composition of two proportional rotations about 
perpendicular skew axes. Such motions occur as relative motions at worm 
gears that transmit a rotation from one axis to another skew perpendicular 
axis. 

A point is rotated about the z-axis (through an angle a) and it simultaneously 
rotates about the axis 6 which is also undergoing the first rotation. The 
rotation about is proportional to that about a, and thus, the second rotation 
angle is 3 = na (Fig. 7.10a). The same result is achieved if we first rotate 
about the axis (0/-1r/0) parallel to the x-axis through the angle @ and, after 
that, about the z-axis through the angle a. A point P (position vector #) is 
transformed to the position 


0 0 


Pi = Rz(a)-| Rx(B)-(B-] -r }) +] -r 
0 0 
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Fig. 7.10 rotoidal motion 


Fig. 7.10b shows which surface arises when, for example, all points lying 
on a straight line that intersects b perpendicularly and meets the line a are 
transformed. <<< 


>>> Application: Everything revolves around the parabolic flight. 


Fig. 7.11 Everything revolves around the parabolic flight. 


In Fig. 7.11, different, and sometimes complicated, trajectories are depic- 
ted. Nevertheless, they have one thing in common: The paths of additional 
non-moving body parts — for example, in an ideal somersault — will be pla- 
nar curves generated by rotations about horizontal axes through the center 
of gravity moving along the bomb trajectory, provided that the rotation is 
uniform. 4<< 


Helical motion 


If a point P(pz/py/pz) is rotated about an axis through the angle y and 
simultaneously translated along this axis proportionally to the translation, 
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then one speaks of a helical motion. The ratio c between the translation 
distance and the angle y of rotation is the parameter of the helical motion. 
If the helical axis coincides with the z-axis, then the new point P, has the 
position vector 


0 COS Y Px — SIN Py 
Pi =R,(y)-P+ y] 0] =] singps + siny py |. (cy) 
Cc Pz t+ CY 


For fixed coordinates of P, the equation (7.11) already describes the trajec- 
tory of P in the helical motion around the z-axis by parameter c. One speaks 
of the parametric representation of the helix. In the following section, we will 
develop a number of these parametric equations. 


Fig. 7.12 twining plant 


>>> Application: twining (Fig. 7.12) 

Creeper, climber, and twiner are names for plants that try to spread across 
surfaces like twigs. They combine a translation and a rotation in order 
to find new carrier objects and thus perform helical motions. Sometimes 
they obviously switch the orientation of the rotation (in the left part of the 
image). <<< 


>>> Application: simple and effective damping (Fig. 7.15) 

If you want to fasten your boat in the harbour, it is a good idea to have 
something between the rope and the fixed attachment that protects the rope. 
The depicted helical spring does the job efficiently. When the boat is jerkily 
teared off by waves, the two hooks compress the spring and, mathematically 
speaking, diminish the parameter of the helix. This allows for more distance 
between the boat and the harbor. The spring forces then “harmonically” tear 
the boat back. <<< 


>>> Application: highway exit (Fig. 7.16) 
A 6m wide constantly inclined uphill road (appearing circular from the top 
view) needs to overcome a height difference of 12m. A 270° rotation will take 
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Fig. 7.15 damped spring as a mount of a boat in the harbour 


place, and along the center line, the slope equals 10%. How big is the inner 
radius r,, and the slope along the inner border line? 


Fig. 7.16 motorway or equivalent ... 


The top view of the center line is 120m long, because of the 10% slope. 
A circular arc with radius r,, and central angle of 270° has arc length 
270 7g97m = 4.712rm, and thus, rm = 120m/4.712 » 25.5 m. The inner si- 
de of the road has radius r = r,,-3m = 22.5m. The arc length of its top view 
is  106.03m. Thus, the slope is 12 m/106.03 m » 11.32%. 


We find the helical parameter 
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In the coordinate system shown in the figure, the helices with radius rp have 
the parametric representation 


0 Tm 3 
&(y) = R.(¢)-|-ro]+| rm (0 ys =). 
0 cy 
This equation was used to generate Fig. 7.16. Furthermore, surfaces of con- 
stant slope are shown along the constantly inclined part of the street. These 
surfaces are helical developables. <<< 


——— ye 


Fig. 7.17 “elliptical” helical staircase (top/bottom view) and “ordinary” staircase 


>>> Application: helical staircase (Fig. 7.17) 

Helical staircases are usually formed by applying a helical motion to a “pro- 
totype” of a step. The smoothly polished bottom is called a “helicoid”. The 
banister is a helix. In Fig. 7.17, the stairs have been subjected to a “compres- 
sion” so that the top views of the helices are ellipses and not circles. Write 
the equation of the upper curve assuming that the banister’s height is 1m. 
The base is an ellipse with a semi-major axis of 2m and a semi-minor axis 
of 1m. One full turn of the helix covers a height difference of 4m. 

Solution: 

Initially, we do not worry about the compression. The curve is a helix with 


radius ro = 2 and parameter c = Oa Let the cylinder’s axis be the z-axis 
T 


and the first point of the banister have the coordinates P(1rp/0/1). Then, the 
equation of the helix is 
rg cosy 
Pi =| ro snp 
l+cy 
The compression is achieved by multiplying the y-coordinate by 1/2. <<< 


The main theorem of spatial kinematics 


The following theorem is of fundamental importance for the theory of spatial 
motions: 
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Any two positions of a rigid body can always be transformed into each other 
by a helical motion. 


Fig. 7.18 the main theorem of spatial kinematics 


Proof: The two congruent bodies can be defined by the two associated (and thus, 
also congruent) triangles A; = A,B,C; and Ay = Ap B2C 2 (Fig. 7.18). Now, we 
assume that there is a helical displacement that transforms A, into Ay and show 
that the axis and the parameter are uniquely determined. 

A helical motion is a composition of a rotation and a proportional translation along 
the rotation axis a. Therefore, the vectors 


os 


—— — 
U= A,Ao, v= B, Bo, and w= C1C 


can be decomposed into the translational component f parallel to a and the rotatio- 
nal component normal to a. For all three vectors, t is the same, so that af, o-f, 
and w—f are normal to a. 

We interpret tw, G, and w as position vectors of the three points U, V, and W that 
form a plane ¢ 1 a at the distance s = |#] from the origin, with s being the length 
of the translation distance. The normal vector @ of ¢ is the direction vector of the 
helical axis, parallel to ¢. We obtain it via 


a= UV x UW = (8-t) x (wi). 


We normalize the vector 


and obtain the equation of € as 


The normal distance s from the coordinate origin to ¢ is equal to the constant s =c 
taken from the plane’s equation in Formula (5.38). The translation vector is now 
t = sitio. Now, we apply the translation — to the triangle A» and get the new triangle 
AS which is joined with A; by a rotation about an axis. This axis equals the desired 
helical axis and can be found as the intersection of the bisector planes of A; AS and 
B,B3. 
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® Remark: In any smooth spatial motion, two “neighboring positions” are congruent. 
The corresponding helical motion is then called the instantaneous helical motion, 
and the axis is called the instantaneous helical axis. ® ©) 

>>> Application: determination of the helical motion 

Find the helical motion which moves the triangle A;(—3/0/0), Bi(0/0/0), 
C(0/2/0) into the congruent triangle A2(1/0/2), Bo(1/-3/2), C2(3/-3/2). 
Solution: 

With the notations of the above proof, we have 


4 1 3 
u=A,Aq=|0], v= B, Bog =|-3], w=CiCo =] -5 
2 2 2 
and 
-3 -1 0 
G=(0-&) x (W-U%) =] -3] x] -5] =] 0 
0 0 12 
The vector is easy to normalize: 
0 
ay =10 
1 
The helical axis is parallel to the z-axis. The plane ¢ has the equation 
0 0 4 
O| -=]0]-] 0] =2, 
1 1 2 
so that with s = 2, we get: 
0 
t=s-Go=|0 
2 


We subtract ¢ from A», Bo, C2 and obtain 
2(1/0/0), B3(1/- 3/0), C2(3/ - 3/0). 


According to Formula (5.27), the planes of symmetry of the segments A;A} 
and B, Bs are 


4 i iy (3) 
ee 10 @==)0) <1 0 | l==%, 
0 21\o 0 
1 Ae 2 10\" 
02: 3)-2=- 3 0 = 5. 
0 0 0 


The intersection a of these two vertical planes is the helical axis. It is parallel 
to the z-axis and passes through the point (-1/ - 2/0). <<< 
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7.2 Matrix calculations and some applications 


In the following, we will calculate with matrices. We need them for some app- 
lications, such as coordinate transformations, but also for tasks from business 
economics. 

An (n,m)-matriz A is nothing more than a rectangular array of numbers 
with n rows and m columns. The transposed matrix A’ is an (m,n)-matrix 
produced by interchanging rows and columns. 

Vectors are also matrices in this sense. An ordinary three-dimensional vector 
can be considered as a (3, 1)-matrix, or, transposed to a row vector, as a (1,3) 
matrix. Our rotation matrices from above are (3,3)-matrices if we calculate 
in three-dimensional space or (2,2)-matrices if we calculate in the plane. 
Each row or each column of a matrix is called a row vector or a column vector. 
Matrices can be used — as we have covertly seen with the rotation matrices 
— to write confusing situations (with many indices) in a very clear way. In 
business economics, they are used to manage large amounts of interdependent 
data. The great advantage of writing in this way is that now you can perform 
tasks by means of so-called matrix multiplication that would otherwise almost 
reach the breaking point of being barely readable. 

The definition of the multiplication of two matrices is as follows: 

Let A be an (m,n)-matrix, and B an (n,k)-matrix. A must have as many 
columns as B has rows. We define the result A-B of the multiplication as an 
(m,k)-matrix C. Its elements cj; are the respective dot product of the i-th 
row vector of A with the j-th column vector of B. 


Described in detail, the definition looks like this: 


aj a2 Qin bi bie Dik 
a a a b b b 
A= 21 22 2n ; B= ah 22 2k 
Am1 Am2 Amn Dnt bn2 Dk 
C11 
=>C=A.Be=|-: Cij oo | with Cig = ajb4; + ajgbo5 tae Qindn;- (7.02) 
Cnk 


>>> Application: multiplication of vector and matrix 


Multiply the vector @ = & once from “the left” and once from “the right” 
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i. 


by the matrix A = 3 


-2 ee 
_ 7 by considering it once as a row vector and then 


as a column vector. 
Solution: 


ey ae -»)-(; )= 1+ (-9):3 4-(-2) + (-5)-( (4). 


aoe 3)-(4)-G23:3)- (1). 


@® Remark: The example shows that the order of multiplication matters. A row 


vector will yield another row vector, and a column vector will result in another 
column vector. ® <<< 


Change of coordinate systems 
We can consider a coordinate system as a spatial tripod. It consists of mutually 


orthogonal unit vectors €, f, g, and the position vector t towards the 
origin 7’. For the “basis vectors”, we have 


eS) 29 2, Sef ae 2 ee St (7.13) 


We denote the two matrices that can be defined by the triple €, f, Gg by 
K and K’, where 


Ex €y €z zi er fx Gz 
K=|fe fy fel, K =ley fy gyl- (7.14) 
Gr Gy Gz ez ie Jz 


Via Formula (7.13), it can be seen that the product of K and K? yields the 
identity matric E. The entries of the product matrix are defined as the dot 
products of the corresponding row and column vectors. One may even swap 
matrices (this is a little harder to show), so that finally, the following holds: 


100 
K-K’ =K’.K=E=|0 1 OJ]. (7.15) 
001 


® Remark: Matrices with this property are called orthogonal matrices. ® 


Now, we move a spatial tripod from one position to another, taking a point P 
which is firmly connected to the system (Fig. 7.19) with it. Based on the cur- 
rent position of the tripod, the point has constant coordinates or a constant 
position vector p. What position vector does the point P* have relative to 
the original coordinate system? Conversely, a point in the original system 
has the position vector 2p, what is its position vector in the new system? 
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Fig. 7.19 motion of the coordinate systems 


Linear algebra provides the clear formulas 


=> 


p'=t"+K".p, DP=K-(p*-t*). (7.16) 


Proof: Initially, we have: € +P = py, ten 7 Py, G°D = pz, because if € is the 
angle between € and 7, then 


> => > => 
€+p €+-p a ae 

cos € = ——> —— > €- p =|p|-cose = pz, ete. 
é|-lp] |p| 


Furthermore, the following holds: 
— * * 
p=p -t. 


Therefore, according to Formula (7.12), we get, 


e-p 
Cx Cy Ez Pa ° Px 
* —>* => 
K-(p -t )= fx fy fz |- Py |= f-D =| Py = 7. 
Ge Gy Gz) \dz Gp Dz 


This proves the right-hand side of the formula. Now, we multiply the right equation 
on the left by K” and find 


KT .K-(B"-¢ )=K™-P. 


Further, we have K’ -K = E, cf. Formula (7.15). “Nothing happens” if we multiply 
a vector by E. After addition of ¢* to both sides, we face the left-hand side of the 
equation in Formula (7.16). © 
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7.3  Parametrization of curves 


Parametric representation of a circle in R? 

A circle “in principal position” (center 1/(0/0), radius r) can immediately 
be parametrized. It arises by rotating the point (r/0) about the origin (the 
rotation angle wu is given in radians for the purpose of calculating with a 
computer): 


._ _(rcosu 
w= (7 (O<u<2n). (7.17) 


Now, the midpoint (center of rotation) shall be the point M(mz/my,). Then, 


we have 
r COSU+ My 


rsinu+ My 


t=t9+m=( (0 <u <2z). (7.18) 


Intersection of a circle with a straight line 


Now, let i-% = c be the implicit equation of a line g. The intersection with 
an arbitrary circle then satisfies the condition 


. [rcosu+mMez 
Te i =C 
rsinu+ My 


This leads to an equation of the type Pcosu+ Qsinu+ R= 0: 


NpT COSUF NyT SINUtNgMyz +NyMy — c= 0, 
—=——_ eo ——_., —_—_<“_— 
P Q R 


thus, a quadratic equation in u. Real solutions are present if gM <r. 


>>> Application: satellite in the Earth’s shadow, lunar eclipse (Fig. 7.20) 
A satellite orbits the Earth at an altitude of 500 km (by the way, this enforces 
an orbital period of about 1.5 hours). How long is the satellite in the Earth’s 
shadow? 

Solution: 

Essentially, the problem is two-dimensional. That is, we can consider the 
circular satellite orbit in the xy-plane: 


z= (’ can with r=R+500km, R=6,370km. 
rsinwu 


The streak of light to the Earth’s contour is the line of intersection 


(C0) 


316 Chapter 7: Curves and surfaces 


Fig. 7.20 satellite Fig. 7.21 the Moon in the Earth’s shadow (35 h) 


Now, we intersect the straight line and the circle: 


(") . ¢ 7) Spd 120. ores yen 
1 r sin u R+500 


The second solution u = 180° — 68° is not relevant. The satellite is in the 
Earth’s shadow for —-68° < u < 68°, thus, for ae 1.5h % 34 min. 

In practice, this time is even shorter: The streak of light is refracted in the 
Earth’s atmosphere towards the lead direction, where the lower-energy red 
portion of the light is further refracted. This phenomenon occurs during the 
lunar eclipse, where, instead of the satellite, the Moon disappears in the 
Earth’s shadow. The refracted red light always hits the Moon because of its 
great distance. That is why the Moon becomes “blood red”, but not totally 
dark (Fig. 7.21). <<< 


>>> Application: special ellipse motion (Fig. 7.22) 

Let us assume that we are given two perpendicularly intersecting lines on 
which the two points A and B move. The distance s = AB is constant. Then, 
each point C on the line AB traces an ellipse. Prove this. How can the traced 
curve be determined pointwise? 

Proof: If one chooses A on the first line a, then one obtains B on the second line 
b as the intersection with a circle around A of radius s. 

Let M = anb. Let us consider the rectangle M APB and its circumcircle q (midpoint 
R). The diagonals MP and AB enclose the same angle a with the axis a. Let us 
now consider the points K and N on MP and b (or on the line parallel to a through 
C). The distance MK equals AC, and therefore, it is constant. Thus, k moves along 
a circle around M. Furthermore, NK is always proportional to NC, because it is 
constant since t = RC = constant, and thus, 


NC _ (s/2+t)cosa _ s/2+t 
NK (s/2-t)cosa s/2-t 


So, the locus of C' is obtained by stretching K’s circular orbit and is, hence, an 


ellipse. ©) 


Since MP = AB = s, the circle q has the constant radius s/2. For each position 
of AB, the point P lies on the circle p centered at M with the double radius 
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Fig. 7.22 specific and general ... Fig. 7.23 ... ellipse motion 


r. Since the point P is the intersection of the path normals of A and B, it 
is the instantaneous pole of the motion (theorem on path normals p. 300). 
The circle g can be viewed as the isoptic circle of the segment AP (see more 
Fig. 4.52). According to the inscribed angle theorem, 2 ARP = 22 AMP. 
Therefore, the circular arc AgP (measured on p) is of the same length as the 
arc AP (measured on q) for any position of P. So, we can say: The circle 
q rolls without gliding on the circle p that is twice the size. Therefore, an 
ellipse motion can also be generated by rolling circles (ratio of radii 2: 1). 
These are special types of hypotrochoids (Application p. 300). <<< 


Intersecting two circles in the plane 

The intersection of two circles can be traced back to the intersection of a 
circle with a straight line (radical line): The straight line joining the points 
of intersection is orthogonal to the connection of the two centers. 


Its distance d; from the center of the first 
circle results from the Law of Cosines (with 
the notations from the adjacent figure): 


r2 = d? +1? - 2dr, cosa 


e__’ 
dy 
are Fe 
== — + — 2 
Fig. 7.24 intersection of two circles 2d 


>>> Application: general ellipse motion (Fig. 7.23) 

Let there be two intersecting lines wu and v which form an arbitrary angle. 
On these lines, two points U and V move. The distance s = UV must be 
kept constant. Then, each point C rigidly connected to the line UV traces 
an ellipse (with UC = d and VC = e). How do we find the point C if we are 
given the position of U? 
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Solution: 

The circle about U with radius s meets v in two possible positions of V. The 
circles about U of radius d and the circle about V with radius e deliver the 
point C (two solutions). 

The intersection of M = unv is the midpoint of the orbit. The circle g through 
U,V, and M (center R) is called the pitch circle. It can be interpreted as the 
isoptic circle of the segment UV, because the angle 2U MV is constant (see 
Fig. 4.52). Thus, q is independent of the choice of U. According to Thales’s 
theorem, the point P opposite to M on the pitch circle is the intersection of 
the path normals of U and V. Thus, it is the instantaneous pole (theorem of 
path normals on p. 300), and the radius of the circle p about M through P 
is also constant. The ratio of the radii of p and q equals 2: 1, and q always 
touches p from the inside. 

Now consider the path of the point A on the pitch circle opposite to C. 
According to the theorem of inscribed angles, the central angle 2 PRA is 
twice the contact angle 2 PMA. Thus, the length of the arc PAg on p is 
equal to the length of the arc PA on gq. Since this is true for any position 
of C’, there exists a real rolling of g in p. Therefore, the two arcs are always 
of the same length, and there is a real rolling of circles in the ellipse motion 
previously described. Here, A travels obviously on the line Ap. The same 
applies to the point B opposite to A on the pitch circle. The two orbital lines 
of A and B are perpendicular. The segment AB is of constant length. Thus, 
one can also achieve the same motion when A moves on the fixed line MA 
and B on the fixed perpendicular line MB. For further information on the 
proof, see the previous example. <<< 


>>> Application: four bar mechanisms 
The following mechanisms are used very often in technical applications: First 
a rod LA rotates about its fixed bearing point L, and a second rod MB 
rotates about its fixed bearing point M. The motion of the rods is coupled 
by a fixed orbit length AB. Each additional point C' rigidly attached to the 
rod AB traces a so-called coupler curve. 


Fig. 7.25 a typical coupler curve (two snapshots) 


How is the position of C calculated provided that the first rod LA is given? 
Determine the path tangent and instantaneous velocity of C. 
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Solution: 

Everything leads to the intersection of two circles. First, B is determined 
through the intersection of the circle around A with radius AB with the 
circle centered at M with radius MB (two solutions). Then, C lies in the 
intersection of the circle about A with radius AC with the circle about B 
with radius BC. This time, because of the sense of revolution of ABC, only 
one point of intersection is permitted. 

The path tangent of C can easily be determined: We know the orbital circles 
of A and B. The associated path normals are the lines of the rods LA and 
MB. They intersect each other according to the theorem on path normals 
(p. 300) at the instantaneous pole P. The path tangent at C is, then, per- 
pendicular to the polar radius PC. The distance PC to the instantaneous 
pole (center of rotation) is also proportional to the instantaneous velocity 


of C. ddd 


>>> Application: crane clearing the cargo ship 
In what direction does the cable hook of the crane located on the pier (Fig. 
7.26) move? 


Pp 


Fig. 7.26 ship’s crane Fig. 7.27 circle in space 


Solution: 

One recognizes the quadrangle LABM with the fixed points ZL and M about 
which the points A and B rotate. This results in the instantaneous pole P 
and the path tangent at the point C' which are — as expected — horizontal. The 
hook H is always right below C' and also has a horizontal tangent provided 
that the pitch is constant. <<< 


Parameter representation of a circle in R® 


In three-dimensional space, the situation is generally much more complicated. 
First, we consider the special case in which the center equals the origin and 
the circles’ axis is the z-axis: 
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r cosu 
Zo=|rsinu] (O<wu< 2z). (7.19) 
0 


Now, we want to create a circle by rotation of a point about an arbitrary 
axis. 


We let 
Ay 
4 = : ee ey 
G@=|ay| with \/a,+a,+az= 
az 


be the normalized direction vector of the rotation axis. The axis may contain 
the origin. 

The following considerations shall be supported by a look at Fig. 7.27: We 
find the direction of the axis if we rotate the z-axis through the angle -¢ 
about the z-axis and then rotate through the angle —(@ about the z-axis. 
The following relations are valid: 


: az x a 
cosé=az, siné=,/a2+a2=s, or cosB = —, sin 8 = —. (7.20) 


The rotation of the points on the circle (7.19) about the z-axis (rotation 
angle -e) yields 
r cosu 
% =R,x(-¢€)-% =| razsinu 
rssinu 


The subsequent rotation about the z-axis finally yields 
“(Gz COSU + Gy az sin u) 
& = R,(-8)-%1 =| (ay cosu— aya, sinu) 
rssinu 


Now, we can even prescribe an arbitrary center M(m,/my/m,) and find 
the general parametric representation of a circle in three-dimensional space, 
which, though not simple, is often quite useful: 


Mz + (ag cosu + aya, sin u) 


E=|my+T(aycosu-aza,sinu) | (s=/a2+a2, 0<u<2z). (7.21) 


m,+rssinu 


>>> Application: boundary of the shadow on the globe 

The day-night boundary on the Earth is a great circle in a plane perpendicu- 
lar to the direction of the light ray. On which points does the Sun rise above 
or below the latitude wy? 
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Fig. 7.28 and yet it moves... 
Solution: 


The (normalized) vector of the light ray is the direction vector a of the axis 
of the Earth’s circular terminator. The radius R of the terminator equals 
that of the Earth. 

We obtain the desired points on the given latitude by the intersection of the 
shadow’s boundary with the plane z = Rsiny. Consequently, we have 


: . . siny 
Rs sinu = Rsiny => u = arcsin —. 
$ 


For sin y > s, there is no real solution: On the corresponding circle of latitude, 
there is either a 24h day or a 24h night. <<< 


Fig. 7.29 The sphere is not developable. 


>>> Application: The sphere is not developable. 

At this point, we shall have a brief excursion to cartography. As is known, it is 
not possible to flatten the surface of a sphere into a plane without distortion. 
Fig. 7.29 illustrates that the shell of a reasonably spherical mandarin still 
remains curved although it is “torn” in several places (mandarins are easy to 
peel, because they have a poorly developed inner pericarp layer). 

There are now countless methods that somehow allow us to transform the 
spherical surface into the plane. None of them can preserve lengths, areas, 
and angles simultaneously. The simplest method is to identify the longitude 
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Fig. 7.30 the shadow boundary in a rectangular map 


and latitude of a point with Cartesian coordinates. In this coordinate frame, 
the sphere is mapped to a rectangle. Fig. 7.30 shows what the shadow boun- 
dary from Fig. 7.28 looks like in a rectangular map. The intensity of this 
deformation of the spherical surface can best be seen close to the North and 
South Pole. The shadow curve is obviously not a sine curve, as one might 
assume (also see Fig. 3.38). The Earth’s rotation corresponds to a left shift 
in the coordinate grid. << 


>>> Application: shortest flight route (Fig. 7.91) 

The shortest route from P to Q is a great circle, i.e. a circle with the center 
of the Earth M as its center. The circles’ axis is the normal of the plane 
PQM< and the circle has the radius R = 6,370km plus the altitude (aircraft 
10km, satellite between 150km and 36,000km). <<< 


Intersection of a circle with a plane 

Let 7-% = c be the equation of a plane ¢, and let k be an arbitrary circle 
with radius r. We insert % from the equation (7.21) into the equation of the 
plane, and thus, we obtain one more equation of the type 


Psinu+Qcosu+ R=0 


(see p. 143). It has two real solutions if the intersection line of ¢ with the 
circle’s plane has a distance < r from the axis of the circle. 


Intersection of two circles in space 

In general, two circles c; and c2 (centers M;, radii r;) in space have no point 
of intersection. If they lie on one sphere (center K, radius @), then they may 
have common points. In such a case, the axes must intersect at the point K 
(center of the sphere) on the one hand, and on the other hand, the following 
has to hold for both circles: 


—!) 
K M; +r? = 0”. 


>>» Application: Monge’s “sphere technique” (Fig. 7.31) 
In order to determine the intersection curve of two surfaces of revolution with 
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intersecting axes, we choose an auxiliary sphere around the intersection of the 
axes with variable radius. Any such sphere meets the surfaces of revolution 
along one or more circles. Any two circles on either surface have potential 
intersections belonging to the intersection curve. The radii of the auxiliary 
spheres may not be too small or too large. Otherwise, the desired intersection 
circles may not be real. <<< 


Fig. 7.31 Monge’s technique Fig. 7.32 distance circle on a sphere 


>>» Application: distance circles on the sphere (spherical trigonometry) 
Compute those points on 15° E that are 2,000 km away from London (on 
the prime meridian, y = 51.5° N). Does the Sun rise at these points an hour 
earlier? 

Solution: 

A distance of 2,000 km (measured along a great circle on the sphere) cor- 
responds to a central angle of 18° (radius R = 6,370km, circumference 
40,000km). The radius r of the distance circle is, then, 


r= Rsin72° = 1,968km. 
The distance of the center of the circle from the origin is 


a=VR?2-r?, 


The plane carrying the distance circle has the equation 


0 0 0 
—cosy|-%=|-cosy]-]-—acosyp 
sin y sin y asin yp 


Therefore, 
—cospyt+siny z= a(cos” yp + sin? yp) =a. 


The carrier plane with a longitude of 15° (A = 15°) has the equation 


—cosAx+sinAy = 0. 
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For the intersection of the two planes, we immediately find a parametric 
representation 


g=tanAt, y=t, z=(a+cospt)/sing. 


For us, the latitudes of the two intersections are of interest. One equals 
37.5° N (southern coast of Sicily). The other one equals 68° N (above the 
Arctic Circle on the Norwegian Arctic coast). The two points are about 
3,500km apart from each other. Berlin is almost exactly halfway between 
the two points. On the same latitude as London, the Sun will rise on 15° 
E an hour earlier throughout the year. In Sicily, the lengths of the days 
(shorter in summer, longer in winter) are more balanced during the year than 
in London. The Sun rises more than an hour earlier in winter and in less than 
an hour earlier in summer. The opposite is true for northern Scandinavia. 
The aforementioned point on 68° N even lies above the Arctic Circle, and 
on top of that, it has a polar night in the second half of December! This 
could also be formulated as follows: Both sunrise and sunset take place at 
12h noon. <<< 


>>> Application: distance circles on curved surfaces 

Surprisingly, a two-dimensional being on a surface can find out whether and 
to what extent “its carrier surface” is curved without leaving the surface and 
moving to three-dimensional space: It only needs to draw a circle of radius 
r and measure its circumference (Fig. 7.33). If this circumference equals 
27-1, the being lives on a parabolic surface (specifically, in a plane). If the 
circumference is larger, then it lives on a hyperbolic surface. On an elliptic 
surfaces — like the globe — the circumference is smaller than expected. 


Fig. 7.33 distance circles on a variety of curved surfaces 


For example, if we draw a circle around the North Pole of the globe with a 
radius of r = 10,000 km (which equals one quarter of the circumference of 
the Earth), all points on the circle will lie on the equator. Its circumference 
is 40,000 km. However, we compute 27-r » 62,800 km. Our carrier surface 
is, therefore, strongly elliptically curved. <a 


Different representations of curves 


A curve in a plane can be represented in four ways: 
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1. Explicit representation 
If the curve has no tangents parallel to the y-axis, it can be interpreted 
as a function graph and the y-value can be explicitly given: 


y=f(z), 21 <2< 20. 


In vector notation, a point on the curve has the coordinates (a/f(2)). The 
associated tangent is parallel to the direction vector (1/f’(x)). Typical 
examples are the basic functions discussed in the previous chapter y = 7”, 
y = sina, y = cosa, y = tang, and y = e” (all these functions have no 
restriction on x), as well as their inverse functions (with restrictions on 
certain intervals). 


2. Parameter representation 
If the curve also has tangents parallel to the y-axis, one may use the 
parameter representation 


The tangent that corresponds to the curve point is determined by the 
direction vector (#(u)/y(u)). A classical example is the circle 


Fig. 7.34 a space curve using parametric representation 


In “the computer age”, this description of curves is most widely used, 
because it is the most general. It also works well for curves in space, 
where we just need to get a third equation for z. In order to illustrate 
this, we look at the equation of a rotoid (Fig. 7.34, left: top view, right: 
general view) that winds along a torus (see also Fig. 7.71): 


(a-b cosnu) cosu l 
Z=|(a-bcosnu)sinu] (O<u<62, n=--). 
. 3 
b sin(nu) 
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3. Representation in polar coordinates (Fig. 7.45) 
Sometimes, it is convenient to use so-called polar coordinates. Here, the 
distance r from the origin is a function of the polar angle y: 


r=r(y), pis PSs po. 


The tangent that corresponds to a generic point on the curve and the 
polar ray subtend the angle 


wy =arctan ae ((.22) 


(proof in Application p. 327). Classic examples of the description of curves 
by means of polar coordinates are logarithmic spirals (Fig. 7.37) 


r=roe’?’ (k=constant, y€ R) (72a) 


eon 


Fig. 7.35 An Archimedean spiral (left) and a practical approach (right). Strictly 
speaking, the center line of the cable is called an involute, which arises when one 
“uncoils” a thread on a cylinder of revolution (shown in red). 


and — in comparison — the Archimedean spiral (Fig. 7.35, left) 


r=ky (k=constant, yéR). (7.24) 


For the latter, the radius increases by the constant value 27k “per revo- 
lution”, which is nicely shown on the right in Fig. 7.35. 


Polar coordinates can be easily converted into Cartesian coordinates and 
vice versa (p. 331) so that the representation of the curve is equivalent to 
a parametric representation in polar coordinates. 


4. Implicit representation 
We have occasionally seen this type of curve representation. There is no 
coordinate explicitly shown in this representation. The curve’s tangent is 
obtained by implicit differentiation, but will not be discussed in detail. As 
an example, we use once again the circle (radius r, center M(m,z/my)) 


(x@— mz)? + (y- my)? =r? 


7.3 Parametrization of curves 327 


a N 
Fig. 7.36 superellipses with respect to superhyperbolas 


or — as a generalization — the so-called superellipses 


(x/a)" + (y/b)" = 1. 


Among them we find, for n = 2, an ordinary ellipse with axis lengths 2a 
and 2b. On the left-hand side of Fig. 7.36, such curves are displayed for 
n=1,1.5,2,4,8,16. If we modify the equation to (x/a)” — (y/b)” = 1, we 
obtain “superhyperbolas” (Fig. 7.36, right). 


>>> Application: course angle of a logarithmic spiral 
Prove Formula (7.22) and show the following: The “course angle” of the lo- 
garithmic spiral is constant (Fig. 7.37). 


Fig. 7.37 constant course angle of a logarithmic spiral (right: in space) 


Proof: We show Formula (7.22) using Fig. 7.38, middle: For dy > 0, we obviously 
have 
rdp_ or _ r(y) 
dr dry) 

dy O 
For the logarithmic spiral given by Formula (7.23), the following holds 


tanw = 


1 
Hp) =kroe*’? = kr(y) = o = arctan re constant. 


® Remark: During the day, butterflies navigate with the Sun. At night, they use 
the Moon for orientation. In both cases, they point one of their ommatidia (conical 
segment of the compound eye) towards a light source “at infinity” (Fig. 7.39, left). 
The light rays are parallel to each other so that butterflies “only need to flutter” in 
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Fig. 7.38 for the proof Fig. 7.39 a butterfly and a chameleon 


order to fly straight. 

If moths meet an artificial light source (perhaps even fire), then — as they attempt to 
pass straight — they fly towards the light source on a trajectory that approximates 
a logarithmic spiral due to the constant course angle. 

It is also due to the constant course angle that the chameleon rolls its tail (Fig. 7.39, 
right). The vortex lengths behave in a reasonably “exponential” manner. A snake, 
on the other hand, winds itself into an Archimedean spiral, because its (reasonably 
constant) body diameter is added up with every turn (see Fig. 7.35). Viewed in this 
manner, a record groove is also an Archimedean spiral. @ <<< 


Fig. 7.40 Different spirals: an Archimedian spiral (left), a logarithmic spiral (midd- 
le), and several logarithmic spirals that produce a visual effect when the image is 
rotated (right). 


>>» Application: How to distinguish spirals? (Fig. 7.40) 

Spirals are fascinating for humans. Therefore, they often appear in daily life. 
Although there are many different kinds of spirals, the two “classics” appear 
more frequently. How can we see the difference quickly? 


Solution: 

The following criterion allows us to distinguish between the Archemedian 
spiral and the logarithmic spiral: The former has constant distance between 
subsequent windings. In the latter, the distance between subsequent windings 
grows exponentially. 
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® Remark: If we rotate the circular disc in Fig. 7.40 (right), we will immediately 
have the impression that we are either “kicked out of” or “sucked into” the image, 
depending on the way we rotate (clockwise or counter-clockwise). ® daa 


>>> Application: snail shell (Fig. 7.41, Fig. 7.42) 
Why are the contours (seen from the the top view) of snail shells (mussel 
shells, nautilus shells) logarithmic spirals? 


Fig. 7.41 middle: fresh water snails mating; left and right: digital model. (The 
interior of a snail shell can be quite complicated.) 


Fig. 7.42 clam and a digitalized model 


Solution: 
Obviously, snails have the following “growth strategy”: 


e The animal grows “around an axis” (apparently to remain compact). 
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e The magnification dr of the distance to the axis is directly proportional 
to the previous maximum distance r (proportionality factor k). 


e The increase dy of the rotation angle is proportional to the increase in 
distance dr. 


Fig. 7.43 200 million years old ammonites: spinning to the left or to the right? 


“Viewed from above’, i.e. in the direction of the spiral axis (Fig. 7.37, right), 
we get the contour of the snail shell and thus obtain the relation 


dr=krdy. 


We have just noted (in Application p. 326) the following property of the 
logarithmic spiral: 


Ce as ee 


If the growth follows the mentioned principles, intermediate irregularities can 
occur without changing the overall geometric shape. If the snail finds more 
food during the season, it will grow faster. Otherwise, it can take a break in 
between growth phases, without this becoming visible at a later stage. 


® Remark: We have just solved a “differential equation” by means of a comparison 
The example could, therefore, be given in the next chapter, where we provide general 
solution strategies for simple differential equations. ® 


® Remark: Snail shells or mussel shells that “spread to three dimensions” can be 
either left- or right-handed. Virtually, almost all snail shells wind in the clockwise 
direction (mathematically negative) when viewed from above. This is not the case 
with the “snail king”. It “winds to the right”. For the ammonites in Fig. 7.43 — at 
least at first glance — there is no above and below. Therefore, you cannot say in 
which direction the shell spirals. @ <<< 


>>> Application: Stay the course! 
What happens if you travel long distances by plane or boat with a constant 
course angle (see also Application p. 547)? 
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‘a 


Fig. 7.44 odyssey on the sphere: with compass (left) and without (right) 


Solution: 

One does not travel on a great circle (which would be the shortest route from 
start to finish, Application p. 361), but starts — like a moth cruising around 
a light source — around one of the poles in spiral form. The exact solution is 
obtained by stereographic projection from the South Pole to the tangent plane 
at the North Pole (Fig. 7.44, left). This projection is conformal, as is shown, 
for example, in Geometry and its Applications in Art, Nature and Technology. 
The meridians of the sphere are mapped to straight rays through the North 
Pole in the tangent plane. The constant rate angle produces a logarithmic 
spiral. Projected back onto the sphere, one obtains a lorodrome on the sphere. 


® Remark: Exceptions: If you drive precisely towards the north or south, you move 
on a meridian. If you drive east or west, then you move on a circle of latitude. In 
polar regions, the latter means a pretty big detour. You are probably familiar with 
the following riddle: A bear marches 10 km to the south, 10 km to the west, and 
finally 10 km to the north. At the end of its journey, the bear realizes that it has 
arrived at the starting point. What color is the bear? For the solution, see Fig. 
2.16. ® <<< 


Converting Cartesian coordinates into polar coordinates and back 
1. Let the point P(a/y) (Fig. 7.45) be given. Then, its distance from the 


origin is 
r=Var+y? 


and the course angle is computed from the coordinates, with their signs 
taken into account, as 


y sign x: arctan is if x +0, 
tany=— - 
x 


> p= 


sign y - i if #=0. 
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2. Let the point P be given in polar coordinates P(r; y). Then, its Cartesian 
coordinates are 


x=rcosy, y=rsiny. 


- 


Fig. 7.45 polar coordinates Fig. 7.46 bees and polar coordinates 


® Remark: Bees and polar coordinates: Through their performance, dancing bees 
communicate the location of a food source to their sisters (Fig. 7.46). The central 
axis of their dance shows the direction (the polar angle y to the base point of the 
Sun) and the frequency of the waggle motion along these lines serves as a measure 
for the distance r of the food source. @ 


Conic Sections 


In addition to straight lines and circles, the most important curves are conics. 
These are curves of degree 2, i.e. they have a maximum of two points of 
intersection with a generic straight line. If the intersection with a tangent line 
is counted twice, and if you further admit “complex” solutions (Section B), a 
conic always has exactly two points of intersections with a straight line. 


Fig. 7.47 Diirer’s constructions of conic sections. These famous drawings in his 
book Underweysung der Messung, mit dem Zirckel und Richtscheyt, in Linien, Ebe- 
nen unnd gantzen corporen, from 1525 lack a little bit of symmetry. 
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Fig. 7.48 conics as planar intersections of cones of revolution 


The name conic already gives a hint: It is the planar intersection curve of a 
cone of revolution. These curves include the ellipse (the circle is a special case 
of it), the parabola, and the hyperbola. The decisive factor is the location of 
the intersecting plane. Let us imagine the cone of revolution with a vertical 
axis so that all generators (and also all tangent planes) form the same angle 
of inclination with the horizontal base plane 6 (Fig. 7.48). The cone is “infi- 
nitely long” in both directions. Then, all planes o that are less inclined than 
the tangent planes intersect along ellipses, and all steeper planes intersect 
along hyperbolas. In the limit, i.e. between ellipses and hyperbolas, we will 
find parabolas. Since antiquity, conic sections have been of great interest in 
mathematics and related disciplines. Therefore, one could give a whole lec- 
ture on them. Here, we will cite only some important geometrical properties, 
especially those which are common to all conics. 


e Ellipses and hyperbolas have two focal points, whereas parabolas have 
only one. These focal points are the points of contact with those two 
spheres that are inscribed into the cone and touch the plane o. The lat- 
ter spheres are called the Dandelin spheres after their discoverer (G.P. 
Dandelin, 1794-1847). The Dandelin spheres enable us to show that, for 
the points on an ellipse/hyperbola, the sum/difference (absolute value) of 
the distances to the focal points is constant. We will take a closer look at 
the special case of the parabola because of its great technical importance 
(Application p. 342). 
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e The parallel projection of a conic to a fixed plane is again a conic of the 
same affine type. That is, an ellipse is mapped to an ellipse, a hyperbola 
is mapped to a hyperbola, a parabola is mapped to a parabola with their 
known focal properties. Accordingly, the “top view” — the view from above 
or below — is again a conic of the same type. The top view of the cone’s 
tip is one of the two foci. 


For the sake of simplicity, let us think about the cone whose genera- 
tors/tangent planes are inclined under 45° (as shown in Fig. 7.48). We 
are only interested in the top view. The generators e have the parametric 
representation % = r (cosy, siny,-1)”. Here, r is the radial distance from 
the vertical axis, and y is the rotation angle. The plane of intersection 
with the inclination angle 8 through the point (0/0/-.s) has the equation 
a: €x-z=8 (with c = tan). The intersection eno results in the relation 


——_— (7.25) 
l+ecosy 

which is a representation of the intersection curve in polar coordinates. The 

characteristic value ¢ = tan £ is called the numerical eccentricity. For ¢ = 0, 

there is obviously a circle of radius ro. For 0 < |e| < 1, the denominator 

will never be 0. So, we are dealing with an ellipse. For |e| = 1, we get a 

parabola. For |e| > 1, we get a hyperbola. 


Fig. 7.49 Any kind of conic can be the perspective image of a circle. 


e The central projection (projection from a fixed point to a fixed plane) of a 
conic may change the type, but the result is always a conic. For example, 
an ellipse (the special case of a circle) can be mapped to a hyperbola. 
This occurs in photography over and over again, because there is a central 
projection from the optical center of the lens onto a photosensitive film 
plane. 
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>>> Application: vertices and foci of conics 


Find the coordinates of the principal vertices, the center, and the focal points 
TO 

1+ecos 

2a of the principal vertices or the distance 2e between the focal points. 


of the conic r(y) = . Further, calculate ¢ and ro for a given distance 


Solution: 
The first focal point F) is the origin of the coordinate system. The principal 


um Thus, the first 
lt+e 


principal vertex has the coordinates A (“ / 0). and the second principal 
+€ 


vertices occur for y = 0 and y = zm. We obtain r(0) = 


vertex has the coordinates B ( i 0). Therefore, the length of the major 
-€ 


axis is 


AB =2a= e f a 
lte 


The center is given by 


Greco eed ear) 


—_—_ TOE 
Its x-coordinate e = MF = F . 


| is referred to as the linear eccentricity. 


2 
€ 
The second focal point Fh is symmetric to F, with respect to M, and thus, 
2ro é 
- e -1 / 0) 
Now let 2a and 2e be given. Then, we find 
TOE 
2 
ey fe To ie e, and subsequently, rg = a(1- 7). 
a ddd 
1l-e? 


>>> Application: locating a sound source 

At three locations F,, F2, and F3 whose relative positions are known, one 
hears an explosion at the respective times t1, t2, and tz3. Where did the 
explosion take place? 

Solution: 

Let 5 be the position at which the explosion has occurred, and let s; = SF, 
and s9 = SF. Let c be the speed of sound at the given time, then, sz — s1 = 
c(t2-t,). Thus, S' is located on a hyperbola with foci F, and Fy (2e = 2F, F) 
and 2a = c(t2 — t1). Its equation reads 


r(y) = ie with ¢= = and ro =a(1 meh 
l+ecosy a 
Analogously, S' lies on a (homofocal) hyperbola with foci F, and F3 and the 


equation 
TO 


1+ecos(y+e) 


r(y) = with adjusted a or e. 
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Here, € is the angle at F, in the triangle F, F2F3. The intersection of these 
homofocal conics leads to a quadratic equation (Application p. 144) for cosy. 
So, there are two solutions for y. The right solution can be seen by testing 
the values. 


® Remark: A slightly modified method — namely by introducing spherical coordina- 
tes — is used to determine the epicenter of an earthquake. Before the introduction 
of GPS, it was also the prevalent method for the determination of a ship’s position 
at sea. ® << 


Among the remarkable physical properties of conic sections, we will mention 
only Kepler’s First Law: 


The relative orbits of planets or asteroids and comets around the Sun are 
ellipses, and the Sun is one of the two foci. 


The proof of this is cumbersome and it took the genius of Newton to give. 
It is given in detail in G. Glaeser, H. Stachel, B. Odehnal: The Universe 
of Conics. From the ancient Greeks to 21st century developments. Springer 
Spektrum, 2016, pp. 62-76. 


Fig. 7.50 The elliptical orbits of the planets of our solar system lie nearly in one 
plane. Unfortunately, this also holds true for asteroids (small rocks) and comets. 


>>> Application: The comet is coming! (Fig. 7.50) 

For millennia, mankind has trembled at the thought of a comet hitting their 
planet, which could destroy all life. In fact, 65 million years ago, a comet 
caused an appalling devastation, wiping out the formerly successful kingdom 
of dinosaurs. There are always potentially dangerous situations even if the 
probability of the occurrence of a disaster in the “short term” — that is, 
approximately in the next 10,000 years — is quite low. 
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va 


The Earth’s elliptical orbit is given by the equation r = . A comet 


I 
1+ gp cosp 
r2 
1+0.7 cos(y +06) 
is the constant twisting angle of both ellipse’s principal axes. Where are the 


points of intersection and hence the potential collision points of both conics? 


moves in the same plane on the elliptical orbit r = Here, 6 


Solution: 
By equating the radii, we obtain 


ry r2 


1+g, cosp 140.7 cos(y+d) 


We have already solved this equation in Application p. 144. 


® Remark: The planetary orbits do not intersect each other. So, no collision of 
planets can ever occur. ® <<< 


® Remark: Shooting stars (meteorites) are completely harmless, only haven a dia- 
meter of between 1 to 10 mm. Their appearance varies according to the time of day 
and the time of year: 

First of all, the greatest amounts of meteorites can be seen just before sunrise. If we 
are on the Earth, we move on a circle of latitude because of the Earth’s rotation. 
At the same time, the Earth moves in a direction that is perpendicular to the di- 
rection of the Sun, because the orbit is almost circular. Now, let us assume that the 
Earth travels into a swarm of meteorites. The points whose latitude coincides with 
the breaking of dawn will be most heavily bombarded by the “shower of particles”. 
Points on the opposite side are shielded. They are only caught by particles which 
are about to “overtake” the Earth. 

Furthermore, the Earth always collides with a heap of meteorites in late July/early 
August: In this period, the Earth crosses the meteor shower of the Perseids, which 
is a tube-like elliptical zone containing a swarm of small particles. The particles 
come mainly from the direction that runs opposite to the direction of the Earth’s 
trajectory, and according to the rules of perspective, many shooting stars will ap- 
pear to exit from the same point in the firmament (the common ideal point of the 
proceeding direction). ® 


The theory of space curves 


Planar curves have tangents and normals. If the curve does not lie in a plane, 
then the whole issue becomes more demanding. The term tangent does not 
change its meaning. The curve normal, however, is no longer uniquely defined: 
Each normal to the tangent through the point of contact is a curve normal, 
and lies in the normal plane v of the curve tangent. 

In space, we can also consider a curve point P and two neighboring points 
Q and R. If the three points do not lie on a straight line, they define a 
circle. Now, we let Q and R move towards P. The circumcircle of the three 
points P, Q, and R reaches a limit position, called the osculating circle of 
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Fig. 7.51 osculating circles of a space curve 


the curve at P. Its carrier plane is called the osculating plane. We say: The 
osculating plane shares three (infinitely close) points with the curve at P. In 
Application p. 339, it is shown how to determine the osculating plane of a 
parametrized curve Z(u) via the cross product %(u) x Z(u) of the first and 
the second derivative of the curve. 


Fig. 7.52 accompanying tripod (also called Frenet frame) of a space curve 


Among all normals of the curve at the point P, we can now highlight two 
normals in particular: The first one lies in the osculating plane and is called 
the principal normal. It carries the center of the osculating circle. The second 
one is perpendicular to the osculating plane and is, therefore, located in the 
normal plane. It is called the binormal of the curve at P. The tangent, the 
normal, and the binormal form the curve’s accompanying tripod. The plane 
spanned by the tangent and the binormal is called the rectifying plane. 

The curve’s orthogonal projection onto the osculating plane o at P is a planar 
curve that shares the osculating circle at P’ = P with the space curve (Fig. 
7.53, left). 

The curve’s orthogonal projection onto the rectifying plane @ has (in general) 
a point of inflection at P’ = P (the osculating circle appears as a straight 
line, Fig. 7.53, middle). If P is a so-called handle point — which may be found 
in asymmetry plane of the curve — a flat point on the image curve shows up. 
When we finally project orthogonally onto the normal plane v (Fig. 7.53, 
right), a cusp P’” = P occurs because the curve tangent appears as a point. 
Next to the curvature, geometry also employs the concept of torsion of a 
curve (Fig. 7.54). 
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Fig. 7.53 special projections of a curve 


\* ik &. id 7 = 44,9 j = 


Fig. 7.54 torsion of a space curve and something akin to an accompanying tripod 


Circles (viewed as space curves) have constant curvature and torsion. Howe- 
ver, the torsion equals zero all over the curve. Helices as “generalized circles” 
also have constant curvature and constant non-zero torsion. 

The torsion at a curve point is a measure of the osculating plane’s winding 
about the curve tangent (by analogy, the curvature of a planar curve which 
measures the rotation of the tangent about the center of the osculating circle). 
A curve with zero torsion lies in a fixed plane which is its osculating plane (by 
analogy, a planar curve without curvature stays in the direction of a straight 
line, which is its tangent). 


>>> Application: osculating plane of a space curve 

In Application p. 358, we will need to determine the rectifying plane of a 
point of a space curve. It contains the tangent of the curve and is perpendi- 
cular to the osculating plane. Prove that if (wu) is a parametrized curve, the 
osculating plane (uo) in u = uo is spanned by the derivation vectors a9 and 
Zo. Thus, its normal is given by %(ug) x %(uo). 

Proof: Let Z(u) be a parametrization of a curve whose coordinate functions can 
be expanded in Taylor series. The osculating plane o at a point Zo = Z(uo) is by 
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definition spanned by the point and two “neighboring points”: 


where we have used the Taylor expansion and 73 indicates terms of degree 3 or 
higher. The normal n of the plane o spanned by 29, %}, and Zp is parallel to 


Hl é # 
n= (%5 — Xo) x (% = Zo) = a hk(h+ k) xo x Zo +74 


with 7, indicating terms of degree 4 and higher. Obviously, the normal n is parallel 
to Zo x Zo. 

In order to perform the limit procedure, we cancel the maximum power of h and k, 
i.e., the factor $hk(h+k). Then, after h > 0 and k > 0, we see that the limit of the 
normal n is parallel to % x %. Hence, the limit of the plane through three sufficiently 
close points on the curve contains both the first and the second derivative of the 
curve. © daa 


>>> Application: more surfaces determined by a space curve 


Fig. 7.55 All osculating circles of a spatial curve and their axes form a surface. 


There are more “accompanying surfaces” like the rectifying developable (Ap- 
plication p. 358) which are determined solely by the space curve. E.g., one 
can consider the circular surface that consists of all the osculating circles 
(Fig. 7.55, left). Furthermore, all the axes of these circles form a developable 
surface called the polar developable, see Fig. 7.55, right. <<< 
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7.4 Envelopes 


If a straight line g moves, it envelopes a curve. The simplest case is that of 
a line rotating about a point MM. Then, it envelopes a circle with center M 
and radius r = Mg. In general, the following theorem holds: 


Assume that we are given the equation of a line dependent on some parameter 
t (for example, time): a(t) a + b(t) y = c(t). Then, one obtains the point of 
contact with the envelope c by intersecting the straight line with its “derivative 
line” a(t) a + b(t) y = c(t). 


Proof: Let ¢(t) = ) be a parametrization of the enveloped curve c. Then, its 


tangent at the point C(u(t)/v(t)) reads 
=> u(t) u(t) 
seo (40) a(20) 


i(t) 
—u(t) 


Multiplication by the normal vector ( yields the non-parametric representa- 


tion 
ve-uy=vU-UD. 


The equation of the tangent describes the line g. We differentiate the equation with 
respect to ¢ and obtain again a straight line (the “derivative of the line” g): 


ba-ty=vutou-(tivtud) =tu-wv. 


Apparently, « = u and y = v satisfy both linear equations, and therefore, C(u/v) is 
the point of intersection of the lines. ©) 


An envelope of lines is often visible in specular cups and is called a focal 
curve (Fig. 7.56; Fig. 6.47, right). If the cup is a cylinder of revolution, a 
pure reflection can be observed in the top view (Fig. 4.55). 


Fig. 7.56 envelopes in a cooking pot Fig. 7.57 a parabola as an envelope of a 
(two symmetric light sources) right-angle hook 
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>>» Application: a parabola as an envelope (Fig. 7.57) 

A right-angle hook is placed in such a way that the apex lies on the y-axis 
and one of the legs goes through a fixed point F(a/0). Calculate the equation 
of the envelope of the second leg. 


Solution: 


Let $(0/t) be the apex of the right-angle hook. Then, 7 = SF = e is the 


normal vector of the line g of the second leg. Consequently, 


» (}#-(3)(ornan 


By differentiating with respect to t, we obtain the derivative of the line 
g: -y=-2t> y= 2t. 


We intersect the two lines g and g by substituting y = 2¢ in the equation of 
g: 

12 
ax—t(2t)=-P >a= —. 
a 


A parametrization and an equation of the envelope are, thus, 


‘s 2 
r=—, y=2t or yo =4az. 
a 
They describe a parabola. The y-axis is tangent at the vertex of the parabola 
which is the origin O of the coordinate system. F' is the focal point. 


Fig. 7.58 parabolic headlights Fig. 7.59 parabolic cylinder 


Let C, be the projection of the parabola’s point C onto the x-axis (Fig. 7.57) 
and Cy the projection onto the y-axis. The main properties of the parabola 
can directly be derived from y = 2t > OS = SCy: 


1. If one reflects F’ in the tangent, the “counter-point” F™ is always on x = —a 
which is parallel to the y-axis (the “directrix” 1). Because of the mirror 
property given by the “classical parabola definition”, we clearly see: 


CF =CF*. 
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Thus, the parabola is the locus of all points equidistant to a fixed point 
F and a fixed line J. 


2. The focal rays FC are reflected in the parabola and emerge parallel to 
the axis (also as a result of reflection). This lovely property is utilized for 
headlights (Fig. 7.58). Conversely, incoming rays that are parallel to the 
axis are reflected to the focal point, which, for example, is utilized in radio 
telescopes and “satellite receivers” (Fig. 7.59). 


3. The subnormal CN has a constant length of 2a. 


4. The subtangent CT is halved by the vertex. Similarly, the tangent section 
CT is bisected by the vertex tangent (TS = CS). 


>>> Application: parabolic headlights and sun collectors 

By positioning a point light source at the focal point of a reflecting paraboloid 
of revolution, it can be ensured that all light rays that hit the mirror will be 
reflected parallel to the axis. Such lights are called “long distance beams”. In 


practice, rod-shaped light sources are often used. 


= 4 “Sag, 


Fig. 7.60 parabolic screen in an east-west-direction 


Parabolic cylinders are utilized for such cases, which are also useful for focu- 
sing sunlight on a pipe. This pipe may contain liquid (e.g., thermo oil) which 
is first heated and then diverted. The screen is oriented in an east-west di- 
rection and is constantly moved to ensure that the sun stays in its symmetry 
plane. <<< 


>>» Application: the envelope of a straight line in an ellipse motion (Fig. 
7.61) 

A rod XY with constant length is guided so that X moves along the z-axis 
and Y along the y-axis. Calculate the curve enveloped by the rod (demo 
program astroid. exe). 
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Fig. 7.61 an astroid as an envelope Fig. 7.62 space requirement of the 
connecting rod 


Solution: 
Let a= XY be the length of the rod and y the angle enclosed with the x-axis. 
Then, we get X(a cosy/0) and Y(0/a sing). Substituting these coordinate 
pairs proves that the carrier line g = XY is described by 

x y 


=1. 
acosp asing 


We multiply by the common denominator and arrive at 
g: sngx+cosyy=asinycosy. 


This time, of course, y is the parameter. If we differentiate with respect to 
vy, we obtain the derivative of the line 


g: cosy2-sinyy =alcos ycosy + siny (-siny) |. 


In order to calculate the intersection point C' = gng — coincidentally, the two 
direction vectors of g and g are always perpendicular to each other because 
their dot product always disappears (Fig. 7.61) — we multiply the equation 
of g by sing and that of g by cosy and add both equations (thus, y is 
eliminated): 


(sin? y + cos” ~) x = a[sin” y cos y + cos? y - sin” y cosy]. 
Since sin? y + cos” y = 1, we obtain x = acos*y. Analogously, we find y = 
a sin? y. The thus described curve is known as an astroid. 


v : y a : 
® Remark: From cosy = f= or siny = ff we can eliminate y because sin? y + 
a a 


cos” y = 1, and we get 


Va? + ¥/y? = Va? = constant. 


Cubing twice will then show that the astroid is a curve of degree 6. © <<< 
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The envelopes of straight lines are frequently needed in order to determine 
space requirements of mechanisms (Fig. 7.62). Nowadays, computer anima- 
tions are used for the development of transmissions. However, when using 
such animations, a helpful parameter representation of the moving line g(t) 
is rarely available. In such a case, one can try the approximation with two 
“neighboring lines” g(t—¢) and g(t+e) — with a “very small <” (with regard 
to numerical instabilities, the chosen ¢ should not be too small; experience 
shows that ¢ = 1074 is a good value) — and so we get the following statement: 


The intersection of two “neighboring straight lines’ also provides an approxi- 
mation of the point of contact with the envelope. 


Proof: The statement is obviously true for an ordinary rotation around a fixed 
point. Following the main theorem of planar kinematics, even the most complica- 
ted motion can be interpreted as an infinitesimal rotation (or translation) at each 
instant. © 

The statement above applies not only to envelopes of straight lines, but quite 
generally to any envelope of curves (contact is only related to the tangent 
and not to the curvature). So, you only have to intersect two “neighboring 
curves”, and this will yield the points of the envelope. 


>>» Application: envelope of circles (Fig. 7.63) 
Without using differential calculus, we will investigate the envelope(s) of a 
circle k whose center M moves along a curve m (for example, an ellipse). 


Solution: 

One intersects pairs of “neighboring circles”, which always provides two points 
on the normal to m at M. Properly connected, this results in two curves. If 
m is an ellipse, the outer branch is always oval and approximately elliptical, 
the inner branch — with a sufficiently large radius of k — may have cusps. 
Fig. 7.63 can be interpreted spatially as the silhouette of a torus under normal 
projection. <<< 


Fig. 7.63 envelope of circles Fig. 7.64 Wankel engine 


>>» Application: envelope in the Wankel engine (Fig. 7.64) 
One practical application of an envelope shows up in the Wankel engine. It is 
basically an equilateral triangle (the core of the piston) that moves through 
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gears (and is, thereby, turned around within the piston chamber outlining a 
trochoid). 

The enveloped curves of the triangle’s sides fit without difficulty into the 
piston chamber. For the purpose of high-compression (and simultaneously 
better sealing), the center piece can be “thickened” as long as the envelope 
has just enough space in the piston chamber. In Fig. 7.64, this is done by 
means of circular arcs (in practice, one has opted for a slightly deviating form 
in order to create a more compressing profile). << 


>>> Application: stability of a ship (Fig. 7.67) 

A ship is sinking into the water due to its own weight. One might be tempted 
to think that the exclusive criterion for the stability of the ship in a heavy 
swell is that the center of gravity should be as low as possible. A low center 
of gravity is desirable, but the decisive factor is the position of the so-called 
“metacenter” M — which is the cusp of all the “buoyancy lines” of the envelope 
(the buoyancy line is the vertical line through the center of gravity of the 
displaced water; relative to the pitching and tossing ship this line is, then, 
of course no longer vertical). The location of the metacenter M is decisively 
influenced by the hull’s shape. The higher M is above the center of gravity G 
of the ship’s profile, the more stability it provides to the ship. Explain why 
this is the case. 


; = » 5) VS 
Sa hi = 


Fig. 7.65 the inventor him- Fig. 7.66 metacenter for an unfavorable cross-sec- 
self — Eureka! tion — the center of gravity must then be lower 


Solution: 

In Application p. 218, we can see an inclined position of the hull. For this 
position, the center of gravity of the displaced water can be calculated. The 
buoyant force acts upwards in this case. In order for the occurring torque to 
raise the ship, the “buoyancy line” a must pass the center of gravity G under- 
neath. The uplifting torque is proportional to the normal distance aG. Now, 
we consider a series of successive slanting positions and plot the buoyancy 
line in the “cross-section”. Then, we get a series of straight lines that envelope 
a curve. This “metacurve” has an apex on the line of symmetry of the cross- 
section. Since the buoyancy line in a non-inclined position coincides with the 
line of symmetry, the curve has a cusp (the metacenter M) at this point. 
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According to the aforementioned considerations, M has to lie above G. In 
fact, it should lie as far above G as possible so that for all curve tangents — 
up to a maximum tilt angle — G is always on the right side. 


Fig. 7.67 the metacenter of a ship in a typical situation 


® Remark: Nowadays, sailing vessels are constructed so that they set up right on 
their own. This is achieved by ballasting the bottom with plumb. ® <<< 


>>» Application: hazardous profiles (Fig. 7.66) 

If the metacenter (Application p. 346) is under the center of gravity, then 
the ship sinks under unfavorable conditions (especially if high waves bring 
more and more water into the hull). 
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Fig. 7.68 The metacenter changes due to water penetration. 


® Remark: There are countless reports on the sinking of supposedly unsinkable ships. 
One of them concerns the ferry “Estonia”, which capsized near the Polish coast on 
September 28th, 1994, which led to the drowning of 852 people in icy water. By 
means of a lifelike replica model, the shipyard wanted to prove that the sinking was 
caused by poor maintenance and later modifications. However, this is not completely 
true because the simulation of real conditions at sea is still difficult. On the website 
that accompanies this book you will find the demo program stormy_ocean.exe. 
You can select different profiles and simulate a stormy sea. It will forecast whether 
the selected object will sink or not. This is performed by means of a relatively 
complex calculation of metacenters. ® <<< 
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Evolutes 


Differential geometry deals with the limits of functions. For instance, if one 
considers two points P and Q of a planar curve, then its connection line forms 
a secant (lat. secare = to intersect). If Q now moves increasingly closer to P, 
then there exists an unambiguous limiting position which is called tangent to 
the curve (lat. tangere = to touch). One simply says: At the point of contact 
P, the tangent of a curve and the curve share two points which are infinitely 
close together. The normal to the tangent at P is called the curve normal. If 
the curve is a circle, all curve normals pass through the center. However, in 
general, the totality of normals to the curve envelops a curve — the so-called 
evolute. 


Fig. 7.69 tangent, normal, evolute of different curves 


We can also consider two neighboring points Q and RF to the curve point P. 
If the three points do not lie on a straight line, they define a circle. We now 
let Q and R move closer to P. This leads to a well-defined limiting position 
of the circle defined by three points — the osculating circle at the point P 
(a circle can also degenerate into a straight line or a point). The radius of 
the osculating circle is called the radius of curvature. In the vicinity of the 
curve point, it approximates the curve much better than the tangent. The 
osculating circle at P is said to share three points with the curve. 


The locus of the midpoints of all osculating circles is the evolute enveloped by 
the curve normals. 


Proof: The osculating circle is determined by three neighboring points P, Q, and R. 
Its center is, therefore, located at the intersection of the perpendicular bisectors of 
PQ and PR. Let Q* be the midpoint of PQ and R* that of PR. The perpendicular 
bisectors then converge towards the curve normal at Q* and R*. The circumcenter of 
PQR, thus, converges towards the intersection of two adjacent curve normals. @) 


>>> Application: a circle as an evolute 
Fig. 7.69 shows curves with their evolutes. One can see that evolutes have 
cusps whenever the curve has an apex — a point whose radius of curvature is 
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stationary. The right curve is especially notable. It is constructed in such a 
way that its evolute is a circle. It is, thus, called the circle’s involute. <<< 


>>> Application: A curve can be its own evolute. 

In Fig. 7.70, one can see the logarithmic spirals with their evolutes. Since the 
normals of the original spiral are tangents of the evolutes, the evolute also 
intersects a pencil of rays at a constant angle w. Its equation in polar coor- 
dinates (r,y) is r = e?”. The constant parameter p = cot 7 determines how 
fast the logarithmic spiral wraps around its asymptotic point. The evolute 
of such a curve is another logarithmic spiral that is congruent to the initial 
curve if p is chosen appropriately. In special cases, it can even be equal to 
the initial curve (right image). <<< 
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Fig. 7.70 logarithmic spirals with their evolutes 
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7.5 Surfaces 


Fig. 7.71 A regular surface (computer-generated), and photos from a corresponding 
3D-model, which was created by a 3D-Printer. A “mathematical surface” (given by 
parametric equations) has, of course, no thickness. 


When a curve c moves in space, it traces a surface that is generally “curved 
twice”. This also occurs when the “generating curve” is a straight line. In this 
case, one speaks of a ruled surface (Fig. 7.71, left: “rotoidal helicoid”, see also 
Fig. 7.10). The generating curve may change its shape during the course of 
the motion (Fig. 7.72: “Escher torus”, Fig. 7.73). 


Fig. 7.72 an “Escher torus” in two different views 


The generating curves c may be circles, parts of circles, or other curves. Fig. 
7.73 shows such surfaces as parts of snail shells. 

If all circles are of equal size and it is also possible to roll a sphere such that 
it envelopes the same surface that is generated by the congruent circles, we 
obtain a pipe surface (Fig. 7.74 “feng shui spiral” and sculpture in Berlin). 
Among the pipe surfaces, we find the well-known example of the cylinder of 
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Fig. 7.73 spiral surfaces in nature 


revolution, the sphere, and the torus (Fig. 8.40). Toric parts occur very often 
in practice (Fig. 7.75, Application p. 351). 


Fig. 7.74 pipe surfaces 


>>> Application: boundary surfaces of technical objects 
Describe the boundary surfaces of the object depicted in Fig. 7.75. 


Solution: 
The bearing retainer is composed of the following parts: 


e The main part consists of two hollow cylinders Z, and Z2, which are joined 
by a quarter of a torus T (a section is cut out for a better view into the 
interior). 


e The clamps are made of prismatic parts P (i.e. the parts are bounded by 
planes) which are joined by a half-cylinder 73 and a quarter-cylinder 24. 


The universal transmits a rotation about one axis to another axis in a uniform 
manner (this is an improvement compared to the universal, see Application 
p. 219 and Application p. 303). It is composed of the following parts: 


e Two axes a; and ag, materialized by solid cylinders which are rigidly 
connected with two spherical parts 4; and Ng. In the figure, the axis a1 
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Fig. 7.75 torus at a bearing retainer and a universal joint 


and the part of the attached hollow sphere 1 are cut in half, giving us a 
view of the interior of the joint. 


Between the outer hollow sphere /; and the inner spherical segment ‘No, 
there is a thin spherical layer 3 that is mounted and fixed on ¥}. It has 
circular holes in which the spheres 44 can roll. 


These rolling spheres can move both on the inner wall of 4; and on the 
outer wall of Xs in toroidal tracks JT; and Ty. This enables a constant 
transmission of rotation at any adjustable angle. bib 


Surfaces of revolution are formed when a curve c rotates about a fixed axis. 
The curve does not need to lie on a “meridian plane” through the axis, alt- 
hough this would make it easier to imagine the shape of the surface. However, 
one can rotate the points of c into such a meridian plane. The resulting curve 
co generates the same surface (Fig. 7.78) during rotation. 


Fig. 7.76 when a jet of water “severs” ... 


When a jet of water flows from a faucet, it finally “breaks away” due to the 
increasing rate of the descending water (see Application p. 368) and forms 
rotation-surface-like drops within fractions of a millisecond. The only shape 
that you will look for in vein is the classic drop or teardrop shape that tapers 
downwards while pointing upwards. 

Helical surfaces arise when a curve undergoes a helical motion about an axis. 
Again, there are, of course, different types of generation of these surfaces. In- 
tuitively, the surface is best understood when a profile section (perpendicular 
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Fig. 7.77 dew drops descending slowly from a tree bark 


to the axis) and/or a meridian section (planar intersection through the axis) 
is known. Fig. 7.79 shows a helical pipe surface. On the left, we see how it is 
generated by a circle. The image on the right shows its profile sections. 


Fig. 7.78 surface of revolution generated Fig. 7.79 helical surface: two generations 
in two different ways of one and the same surface 


The various positions and the trajectories of the generating curves are called 
parameter lines. In order to distinguish between the different curves, we call 
them v-lines and u-lines: When the v-lines of the motion parameter is set to 
v = vg, we obtain the position of the curve c to the parameter vo. Similarly, 
the paths of the u-lines remain constant on the points of c in which the curve 
parameter is u = Uo. 

The v-lines on ruled surfaces, surfaces of revolution, helical surfaces, and 
spiral surfaces are lines, parallel circles, helices, and cylindro-conical spirals. 
Surfaces can simultaneously belong to more than one of the surface classes 
discussed above. So, a cylinder of revolution, a sphere, and a torus are both 
surfaces of revolution and pipe surfaces. If a surface is formed by screwing 
a sphere (“helical pipe surface”), then it is both a helical surface and a pipe 
surface. In the Baroque period, columns were often made from such surfaces. 
In technology, they are used as chutes (Fig. 4.33). 
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Fig. 7.81 the unfolding, or rather the folding, of a cylinder of revolution (left side), 
“tessellation” (right) 


If the generating curve is a straight line c and if any two “neighboring posi- 
tions” intersect in a point (which may also be at infinity), then the surface 
swept by c is developable. This means it can be unfolded into the plane 
without deformation (Fig. 7.81). There are a few developable surfaces: the 
cylinder (even if it is not a cylinder of revolution), the cone (even if it is not 
a cone of revolution), and the tangent surfaces of space curves. 


>>» Application: from space curves to famous planar curves (Fig. B.5) 
Consider 2D-curves with the parametric equation 


x=asin(t+a), y= bsin(nt+ 8). 


Among them are ellipses (n = 1), eight-loops (n = 2 and n = 1/2), cubic 
parabolas (n = 3 and n = 1/3 with a = 6 = 0), etc. The curves can be 
interpreted as normal projections of space curves which are generated by 
“drawing sine curves on vertical cylinders” (with horizontal axis). For n = 1, 
we have the special case that the whole space curve lies in a plane and is, 
therefore, an ellipse (Fig. 7.81, left). 

A rotation of the carrier cylinder induces a phase shift. 
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ordinary sinusoid (n = 0) a=8=0, n= 3/2 a= 8 =37/4, n= 3/2 
Fig. 7.83 Lissajous curves on an oscilloscope 


All electrical engineers have seen such curves on an oscilloscope (Fig. 7.83), 
when a harmonically alternating voltage is chosen for the entrance of both 
the x- and the y-deviation. In fact, a continuous phase shift creates this exact 
same illusion on the oscilloscope, as if a cylinder of revolution with coiled sine 
curve is rotating. In special cases, the curves are tracked twice for symmetry 
reasons, and they seem to end abruptly because the direction keeps changing. 

ddd 


>>» Application: Mdbius strip (Fig. 7.84) 

In 1865, A.F'. Mébius described a simple method for the creation of a surface 
with only one boundary line and one face. He suggested to take a rectangular 
strip and glue together opposite edges. Ever since, all surfaces of this kind 
have been called “Mobius strips”. The strip inspired M.C. Escher’s famous 
image with the scuttling ants. <<< 


>>» Application: another non-orientable surface (Fig. 7.85) 

Besides the Mobius strip, the Klein bottle is another famous example of 
a non-orientable surface. It has neither an interior nor an exterior. There 
are two Mobius strips on the surface, and along these strips, one can move 
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artwork that was probably inspired by it (“Ferryman” by Tony Cragg) 


continuously in both directions to each point of the surface. On the right- 
hand side of Fig. 7.85, part of such a strip is shown (in green). <<< 


>>> Application: mathematical crochet work (Fig. 7.86, Fig. 7.87) 
Another remarkable surface with self-intersections (in 3-space) was introdu- 
ced in 1901 by Werner Boy. Although it possesses no singularities, it is very 
hard to understand its “inner life”. 


Solution: 
One way to try to solve the mystery is to go through the surface slice by 
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Fig. 7.86 Can you comprehend this surface? 


slice. This technique is proposed in the paper cited below,! which, in turn, 
inspired the artist Lilian Boloney. 


Fig. 7.87 the artist’s approach ... 


Using her crocheting technique, she discovered a “geometric slide rule” that 
can be used to understand arbitrary complicated surfaces. With a soft yarn 
texture, any curvature can be obtained. By stringing together the whole 
surface mesh by mesh, she approximated its final shape. In all likelihood, 
this makes her one of only a few people with a deep understanding of Boy’s 
surface! <<< 


10. Karpenko, W. Li, N. J. Mitra, M. Agrawala: Exploded View Diagrams of Mathematical 
Surfaces IEEE Transactions on Visualization and Computer Graphics archive, Vol. 16 Issue 6, 
Nov. 2010, Pages 1311-1318. 
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7.6 Further applications 


>>> Application: How to create a developable strip? (Fig. 7.88) 

We know that cylinders and cones can be developed, i.e., unfolded into the 
plane without distortions. Let us take a rectangular strip of paper and attach 
the narrow edges to each other. The generated ribbon is, of course, develo- 
pable, since we cannot stretch or compress the paper without tearing it. If 
we do not twist the paper strip, then we get a cylinder. The midline of the 
strip is, thus, its normal section. Twisting produces a developable with the 
midline being a curve in space. The question is now: Is there a “paper strip” 
for an arbitrary space curve c so that c is its midline? This would allow us 
to create paper strips deliberately. 


Fig. 7.88 The rectifying developable — the envelope of the (red) rectifying plane o@ 
— of a space curve can be parametrized so that the whole surface develops into a 
rectangular strip with the midline as the rectified curve. One immediately suspects 
that the surface is developable, since all contour lines are straight. 


Solution: 

A theorem in Differential Geometry says: A surface that is generated as the 
envelope of a moving plane is developable. Let us consider three developables 
that are connected to the space curve: the tangential surface as the envelope 
of all osculating planes, the normal developable as the envelope of all normal 
planes, and the rectifying developable as the envelope of all rectifying planes 
(planes orthogonal to the principal normal). 
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The midline c becomes a straight line when spread out in the plane and is, 
thus, a geodesic curve on the developable. It is known that the osculating 
planes of these are perpendicular to the tangent plane at any point. The 
tangential surface consisting of the osculating planes of the contact curve is, 
thus, not qualified for our purpose. We can see from Fig. 7.55 on the right 
that the normal developable does not even carry the curve. This only leaves 
the rectifying developable, for which all the necessary conditions are met: 
The curve lies on the developable and is also its geodesic line. 

Despite the fact that the development preserves angles, the generatrices of the 
developable do not meet the midline orthogonally. Quite on the contrary: The 
only straight line in the rectifying plane which is orthogonal to the tangent of 
c is the binormal, and the locus of all binormals is not a developable. Thus, 
the method of intersecting two neighboring normal planes of the curve does 
not lead to the solution. In this case, it results in the binormal developable 
which is irrelevant here. <<< 


>>> Application: cutting a pipe connection 

Two water pipes made of stainless steel are shown in (Fig. 7.89), intersecting 
each other. Find a parametric representation of the intersection curve and 
thus the blank of the thin tube. 

@® Remark: The thickness of the steel sheet is to be neglected. Any calculations are 
correct with thin sheets (for example, 2 mm thickness) up to about one millimeter, 
which is sufficient in practice (welded pipes). ® 


Fig. 7.89 pipe joint made of stainless steel 


Solution: 

We identify the axes of two cylinders of revolution ®; and ®; with the y-axis 
and the z-axis. The radii of the cylinders are rj and ro. 

The generator e lying in the yz-plane rotates about the z-axis (rotation angle 
u) and generates the cylinder ®3. The line e is intersected with ®,: 


rg COSU 
e: &-|rgsinu|=0, ©): 2? +2%=r? => en®: (re cosu)?+#? =r. 
t 
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Thus, we have t(u) = +\/r? — r3 cosu? and a parameter representation Z(u) 
of the intersection curve consisting of two branches in general. 

In the normal projections in the direction of the cylinders’ axes, the two 
curve parts appear as circles or circular arcs. In the normal projection onto 
the plane spanned by the axes (the yz-plane), the intersection appears as an 
equilateral hyperbola 


2? —y* = (r? — 12 cosu*) — 2 sinu? = r? - r2, 


“in the front view”, see the middle of (Fig. 7.89). 


When creating the cutting (development or gradation) of ®2, a joint of the 
edge has the arc length b = rgu for the abscissa and the t-value for the 
ordinate. In the Cartesian (b,t)-coordinate system, the bipartite curve of 
intersection (symmetric with respect to the xy-plane) is determined by the 


parametric representation b= rgu, t = 4\/r? — r3 cos? u (0< u< 2r). 


® Remark: If r, is much bigger than r2, the development of the intersection curve 
looks like a sine curve, but differs in the curvatures in the upper and lower vertices. 
The t-values vary between \/r?-r3 and r;. For symmetry reasons, only a part of 
the curve is interesting. 


For r, = r2, the curve splits into two ellipses, and the developable curve into two 
sine curves (Fig. 8.64). ® <<< 


>>> Application: a non-trivial “tessellation” of a triangular rod 

Referring to Fig. 7.90, one takes a sine curve with vertices C, A, D — tem- 
porarily located in one plane — and reflects them in CD (A+ B). The two 
flat sine curves are now projected onto a sinusoidal cylinder with horizontal 
generators parallel to AB. The points C and D remain fixed and the verti- 
ces A and B are “shifted back”. This creates a simply curved papule-leaf-like 
patch that is suitable for “tessellating” a three-sided rod: 


Fig. 7.90 cylinder generated by sinusoids (Talia Nadermann & Sebastian Gomez) 


Reflecting the straight line C'D in the shifted back line AB, one obtains 
the line z which shall serve as the axis of a rotation. By rotating the “papule 
leaf” ABCD twice by +120°, it is possible to make the (blue marked) “leaves” 
touch each other, provided that a suitable choice of AB is given. 
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After rotating this “wreath” by +60° about the axis and translating it along 
the axis by the distance C'D/2, the screwed wreath (marked in red) fits 
seamlessly together with the initial wreath. Through repeated application of 
this helical motion, both aesthetic and very stable structures, like the ones 
shown on the right in Fig. 7.90, can be produced. For representations with 
a computer, parametric descriptions for the leaves’ edges are advantageous. 
Derive the parametrizations and show that these curves are (ordinary planar) 
sinusoids. 

Solution: 

Let a, b, and h be the dimensions of the circumscribed cuboid as in Fig. 7.90 
on the left. In order to bring reflected cylinder parts into contact, b/2: a/2 = 
tan 60° = /3 (= b= V3a) has to hold. A possible parameter representation 
of the symmetric sections of the cylinders is now 


_ fF 43a (1 + cosw) 
Z=|yl=| §(3+cosu) 
vA UL 


For 0 < u < 27, one obtains the section depicted on the left in Fig. 7.90. The 
curves lie in z-parallel planes because «/ V3 = 2y = constant holds (pictured 
bottom left). Thus, the paper edges do not only appear as sinusoids in the 
projection. They are also sinusoidal in three-dimensional space. 


® Remark: At a constant ratio h: a, we get a similar structure. Varying h: a, one 
obtains affinely distorted paper wreaths (see the variants pictured right in Fig. 7.90). 
To calculate the blank (developable) of a “poplar leaf”, we need integral calculus. ® 

ddd 


>>> Application: analysis of the shortest route 

We want to travel along the shortest route with a cruising speed of 800 km/h 
from Vienna (16° E, 48° N) to Los Angeles (135° W, 35° N). (In practice, not 
only the distance, but also the wind directions at an altitude of 10- 11km 
have to be taken into account. In this specific example, the aircraft uses the 
so-called “jet stream” over the Azores on its return flight.) How long is the 
flight path? Where is the airplane after n hours? Where and when does it 
reach the northernmost point? In which direction does it start (apart from 
the predetermined direction of the runway)? 


Solution: 

1. The shortest path on a sphere is a great circle, i.e. a circle whose center is 
the center of the sphere. We calculate the Cartesian coordinates of Vienna 
(P) and Los Angeles (Q). P and Q together with the Earth’s center M (the 
origin) span the plane of the circle. The plane’s normal vector @ specifies the 
parametric representation of the circular orbit. 
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2. We determine the parameter values up and ug corresponding to P and 
Q. The flight distance f and the total flight time are 


f=(ug-up)R  (R=6,380km), or T= oh. 
3. We fly at 800km per hour, or 76-000 = a of the entire circumference of the 


Earth. This corresponds to a central angle of Au = zr After n hours, we are 
at the position u=up+n- Au. 

4. At the northernmost point z = rs sinu is maximal, thus, uo = 7/2. This 
point is reached after “2? hours. 

5. The “starting tangent” ¢ lies in the circle’s plane (> f 1 G) and is per- 
pendicular to the position vector of the starting point P (> f 1 p). So, we 
get t= Gx p. This East-West direction @ in the tangential plane of P equals 
the normal vector of the meridian circle’s plane AMN, where N(0/0/R) is 
the North Pole. The northern direction follows from 7 = @x w. Therefore, we 
have the departure angle ¢ with 


tei 
cos Ee = SS 
él a 


>>> Application: intersection of two flight routes (Fig. 7.91) 

An airplane flies along a great circle from A to B, and another airplane flies 
along a different great circle from C' to D at a constant altitude 10 km. Where 
can they potentially meet? 


Fig. 7.91 Center: two trajectories and their potential meeting. Right: 24 GPS 
satellites at an altitude of more than 20,000 km, evenly distributed into groups of 
four across six planes, each plane being inclined towards the equator plane by 55°. 
Through rotation about the Earth’s axis by 60°, the trajectories of the satellites 
overlap. 


Solution: 

Let M be the center of the Earth. Then, the trajectories are in the great circle 
planes €¢; = ABM and ¢2 = CDM. The potential intersection is, therefore, 
on the intersection of the two planes, at an altitude of 10km. 

We determine the normal vector of the planes: 


E,: ni = AM x BM, €9: Nng=CMx DM. 
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The direction vector 7 of the intersection is perpendicular to both normal 
vectors: 

=>lUlUc oO 

ran, xn. 
We normalize (79 = 7 /|7|) and scale in order to get the direction vector § 
of the intersection point: 


“s = +(6,370+10)km- 76. 


The relevant sign depends on the specific situation. <<< 


>>> Application: determining the position of an aircraft (Fig. 7.92) 
How can an aircraft determine its position by measuring the distances to 
three fixed points A, B, and C (e.g. by means of a laser or radar)? 


Solution: 

Let a, 6, and c denote the distances of the aircraft position P relative to the 
three fixed points A, B, and C. Then, P is contained in the intersection of 
the three spheres /4, “gp, and No centered at these points with radii a, b, 
and c. Any two spheres intersect each other along a circle kap, kac, and kpc 
(see Application p. 147). The carrier planes of two such intersections — for 
instance, €4B, €4c — intersect each other along a straight line s. This line 
is to be intersected with one of the three spheres (resulting in two possible 
positions of P, of which only “the upper” one is possible). <<< 


Fig. 7.92 positioning 


>>» Application: Global Positioning System — GPS (Fig. 7.93) 

The GPS has brought a revolution in the determination of locations. By 
means of handy devices, one can know his/her exact position at any time 
with an accuracy of up to a few meters (!). This is made possible by the fact 
that a relatively small number (24) of strategically distributed satellites (Fig. 
7.91, right) constantly transmits signals containing their current position and 
the corresponding exact time. How can the device determine its own position? 


Solution: 
In principle, the GPS operates as described in Application p. 363: The current 
positions A;, By, and C; of (at least) three satellites are obtained by radio. 
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For a first calculation, a position Q close to the actual position P is chosen 
(e.g., the last calculated position). 

This yields the distances aj = A,Q, b; = BiQ, and c, = CQ. Now, we 
intersect these three spheres centered at A;, Bi, and C (as in Application 
p. 363) and obtain a solution. However, the new positions Az, Bz, and C2 of 
the quickly moving satellites are assumed and the old distances are calculated 
as a result. This results in a “mixed position” Q 2. This, of course, does not 
agree with Q or P for P + Q. The distance d = QQ12 is a measure of the 
error: the smaller d, the closer the actual position P to Q. 

Now, we do the following: We repeatedly carry out the above-described 
“calculation using a sample” by systematically varying the coordinates of 
Q(dz/dy/qz) and observing the direction in which the distance d decreases: 
Then, for example, the values of d are obtained only by varying the z-value. 
Next, we can find a better g,-value through relatively coarse interpolation. 
Now, we change the q,-value and then the q,-value to get the result. In the 
meantime, we will have to include new satellite positions into the calculation 
in order to account for possible changes in position. 


® Remark: Optimizing the step-by-step (iterative) search for the solution is a very 
challenging task, but we will not discuss this here. ® << 


>>> Application: position determination in earlier times 

GPS is an invaluable asset, especially in shipping: One can almost immedia- 
tely get lost out in the ocean. How was the position determined in earlier 
times while travelling on the high seas? 


Fig. 7.94 Portuguese galley Fig. 7.95 The GPS fails — what now? 


Solution: 

The determination of the latitude y is relatively simple: On the northern 
hemisphere, we see the Pole Star under an inclination angle that equals the 
latitude y (on the southern hemisphere, we use the Southern Cross). It was 
also possible — using tables — to draw conclusions about y given the Sun’s 
highest position (the Vikings presumably did so). 

® Remark: It was much more difficult to determine the longitude out on the ocean 
(the well-known Greenwich Observatory in the east of the city of London has been 
internationally recognized since 1885 as lying on the zero meridian). In order to 
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determine the current longitude, it was necessary to have an absolutely precise clock 
on board, which was not technically feasible until the 19th century. The problem 
was indeed serious, and it was not uncommon for a captain to “lose his nerves” 
during an intermediate storm on a week-long voyage from the mainland to some 
remote island and to return on the right circle of latitude in the wrong direction 
because he believed he had missed his destination. ® <<< 


>>> Application: the “wrongly tilted” crescent Moon (Fig. 7.96) 

The following phenomenon is well-known and appears over and over again 
as an unanswered question in the literature and on internet forums: In most 
cases, the bisector of the crescent Moon does not seem to be precisely directed 
at the Sun. Particularly at sunset, when you would expect the bisector to be 
horizontal, it mostly points upwards. How large is this “deviation angle”? 


Fig. 7.96 The seemingly wrong tilt of the crescent Moon at sundown: The bisector 
of the terminator points upwards! 


We deal with close-ups of the Moon, which can only be produced by means 
of an efficient telephoto lens and deliver a good approximation of a normal 
projection where the theorem of the right angles (page 188) holds. 

The path of the light from the Sun to the Moon is straight in any case. The 
outline of the Moon’s sphere is circular. The terminator (a great circle on the 
Moon) appears as a half-ellipse. The center of this ellipse is the projection of 
the Moon’s center. The vertices of the image ellipse lie diametrically opposed 
on the sphere’s outline: The connecting line of these points in space has 
“principal position”. Therefore, it is parallel to the sensor plane. The minor 
axis is orthogonal to the connection of the two vertices, and according to the 
theorem of right angles, it coincides with the normal projection of the circle’s 
axis. 

Now, we want to derive a formula for the tilt of the minor axis or the “bulbous- 
ness” of the ellipse. Imagine that the lens center is the origin of a Cartesian 
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coordinate system. For the sake of simplicity, we define the optical axis as 
the x-axis, the horizontal direction of the sensor plane as the y-direction, and 
the line of steepest slope of the sensor plane as the z-direction (Fig. 7.96). 
The direction to the Moon is also given by m = (1,0,0). The direction to the 
Sun is defined by the direction vector § = (Sz, 5,,5z). Its normal projection 
onto the sensor plane has the components §,, = (0,s,,5z). We measure the 
angle y to the y-axis by normalizing §,, and multiplying by # = (0, 1,0): 


COS Y = Sn +¥ = Sy/\/ 52 + 82+ 82. (7.26) 


The coordinates of the Moon and those of the Sun (in horizontal polar coor- 
dinates given through their azimuth angles a, a* or their difference 6 = a-a* 
and elevation angles ¢ and «*) may be at our disposal (for example, by means 
of relevant software). Let us look at the Cartesian coordinates in a coordinate 
system which has the same y-axis, but whose z-axis is vertical: There, the 
direction vector to the Sun has the components 

8" = (8), 8,,8>) = (cose” cos, cose” sind, sine” ). (7.27) 
Relative to the coordinate system that is twisted through the elevation angle, 
the following applies: 


§ =(s, cose+ 8. sine, s,,—-s),siné + 8; cose). (7.28) 


y? 
If we again insert (7.27) in (7.28), y can be calculated directly by inserting 


: se — be, a 
Sy=SiNdCOSE’, Sz =—cosdcose siné+siné COse 


in (7.26). The tilt w of the sun rays to the picture plane is equivalent to the 
complementary angle to the negative z-axis m determined by 


cosy = —§-™m = —S8z/\ /s? +82 (with s, =cosdcose* cose + sine” sine). 


This cosine value is also a measure of the thickness of the image of the 
terminator. <<< 


8 Infinitesimal calculus 


From Chapter 6, we know that differentiation yields information about the 
instantaneous growth of a function. This growth may be interpreted geometri- 
cally as well as physically (inclination of the tangent, instantaneous velocity, 
instantaneous acceleration, etc.). In this chapter, we will discover additional 
fields of application, such as examples involving minima and maxima, and 
the progression of power series. 

Integral calculus can be described as the “inversion” of differential calculus. 
Broadly speaking, we get information about the area under a function graph 
by performing integration. This property may be interpreted in several ways. 
Geometrically speaking, it becomes possible to calculate areas, volumes, arc 
lengths, or a necessary amount of energy. Probabilities, life expectancies (and 
much more) likewise fall into the realm of integral calculus. The practical 
applications are so diverse that we can only scratch their surface. 

The integration of functions is often taxing to novice mathematicians. So- 
metimes it can even be impossible. In any case, integrals are now being 
increasingly evaluated through numerical means by computers. 
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8.1 Calculation with infinitesimal quantities 


Differential and integral calculus are not fundamentally different. Both are 
concerned with “infinitely small” (infinitesimal) quantities — in both cases, we 
speak of infinitesimal calculus. In general, it can be stated that the calculation 
steps for non-linear problems often require “infinitesimal deliberations”. This 
new method of thinking may require some getting used to, but is ultimately 
quite easy to learn. At first, the most essential part is to imagine the relations 
as “very small”, but not yet infinitesimally (or “infinitely”) so. Then, one may 
attempt a transition into the infinitesimal way of thinking. When analysed 
infinitesimally, the relations are often much easier to understand than the 
rather complicated result would lead us to believe. 


>>> Application: tapering of a jet of water (Fig. 8.1) 
When a jet of water exits a faucet, it produces a surface of revolution that 
tapers down towards its lower end. How is its meridian curve defined? 


Fig. 8.1 continuous and interrupted jet of water 


Solution: 


Let Qo be the cross-section at the height where the jet of water exits the 
tap. The exit velocity is v9. Over time dt the amount of water dV = Qo vp dt 
emerges from the tap. After t seconds, the emerging water is accelerated to 
the speed v = v9 + gt, and the cross-section Q = Q(t) equals 


On-Ort = OG 


VO +gt 


Considering an idealized circular cross-section, the following relation holds 
for the radius «x of the cross-section: 


_ 1 | Qovro 
a, vot gt 


8.1 Calculation with infinitesimal quantities 369 


The following is true for the distance passed by the water, see Formula (2.28): 


s(t) = vt + SP. 


Thus, the meridian curve is already given by the parameter representation. 


® Remark: In practice, the jet will keep changing its shape due to the non-uniform 
flow of water. When the intensity of the water stream is reduced, droplets eventually 
start to form (see Fig. 7.76). ® << 


We have already made a few infinitesimal deliberations. First, let us consider 
that we have interpreted the tangent at the point P of a curve as a straight 
line connecting P with a very close “adjacent point” Q. 

If a transition to the limit Q > P is performed, then the straight line PQ 
becomes exactly the tangent. This transition can be performed geometrically 
by differentiation resulting in the inclination of the tangent value. 

The first derivative of a function is a function measuring the change of its 
values. In general, a function may be differentiated an arbitrary number of 
times. The second derivative yields a measure of the change of the tangent 
inclination, and so on. In the following section, we will have a closer look at 
this. 

To encourage a more careful manner of thinking about infinitesimal relations, 
let us now cite the following famous paradox: 


>>> Application: Zeno’s paradox of the tortoise and Achilles 

A tortoise is moving at a constant speed of 1= (this number may be a little 
high for such a slow animal, but that is not relevant here), and it has an 
advantage of 10m. A fast runner — in Zeno’s example, it is the mythological 
figure of Achilles (~ 450 BC, Fig. 8.2) — moves 10 times faster. When does 
the Greek hero catch up with the slower reptile? 


Fig. 8.2 Achilles and the tortoise 


Solution: 
The following line of reasoning contains an “error”: 
When the runner has reached the starting position of the tortoise, it has 
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already moved on by 1m. Once this new position is reached by Achilles, the 
animal is already 0.1m farther. When he reaches once again the tortoise’s 
position, it has already moved away from the preceding position by 0.01 m — 
and so it continues. This step can be repeated an infinite number of times, 
with the tortoise always remaining in front. It would seem that Archilles can 
never catch up with the tortoise ... 

The error lies in the conclusion: This potentially infinite number of steps 
describes only a small interval of time: 1s+0.1s+0.01ls+...=1.1111...s. 
What happens after this interval is an altogether different question. 

Let us now turn to the “true” solution to the problem: 

For the trajectories of the runners, the following must hold true: sg = s;+10m. 
At the time of overtaking, the time T has elapsed. Then, 


(0 fF sme Ss See ia oe 
Ss Ss 


>>> Application: sum of fractions — the joy of proof 


Prove the formula 
1 1 1 1 1 


= t + tosis SS 
ir 4. 42° 43 3 


Fig. 8.3 proofs without words 


Solution: 

Instead of just using well-known formulas, we give two geometrical proofs:! 
According to Fig. 8.3 (left), the squares on the upper left- and lower right- 
hand are left as is, but the upper right-hand square is itself subdivided into 
four squares, and we repeat recursively the process begun with the initial 
square. Of the area that has been “processed” (three-fourths of the total), 
one-third has been colored. At the next step, we again color one-third of the 
area, and so on. The amount of the area that has been processed converges 
to 1, and so the colored area converges to one-third. 

The second proof (Fig. 8.3, right) is left to the reader. <<< 


1 Roger B. Nelsen: Proofs without Words: Exercises in Visual Thinking (Classroom Resource 
Materials). 
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8.2 Curve sketching 


Drawing graphs of ever more complex functions has become easy in the age 
of computers. It is often not common points of the curve that are of practical 
interest, but points with horizontal tangents, which are known as minima 
and maxima respectively. Occasionally, the so-called inflection points are also 
important. These are the points where a curve passes from a “right curve” to 
a “left curve” (and vice versa). 

We have the fundamental theorem: 


The local extremal values of a curve y = f(x) must satisfy the condition 
y'(x) =0. The inflection point must satisfy the condition y’’(a) = 0. 


Proof: The first part is trivial: At a minimum or maximum (that is, at an extreme 
value), the tangent of the curve has to be horizontal, that is, y’ = 0. 

Now, consider the part of the graph y(x) that is curved to the right. If we consider 
the tangents with increasing x, their slopes y’(x) decrease continuously. The increase 

U 

of y’ is, therefore, negative: ay =y" <0. Similarly, the preceding argument applies 
to “left curves” where y” > 0 holds. Therefore, the transition point (inflection point) 
has to satisfy y"” = 0. © 


® Remark: The above-mentioned conditions are necessary but not sufficient for the 
existence of a local extremum or point of inflection: 

If y’(xo) = 0 and y” (ao) = 0 at a specific place zo, then (a9/y(xo)) is a saddle point 
(provided that y/"(xo) #0). 

If both y(ap) = 0 and y'"(xo) = 0 at xo, then a higher-order point of inflection 
or a flat point is present (provided that either an even-ordered or an odd-ordered 
derivative is the first to be not equal to zero). 

In practice, however, you first draw the graph with a computer and then “optically” 
check such subtleties. ® 


>>> Application: Gaussian bell curve (Fig. 8.4) 
Consider the “normal distribution” curve introduced by Gauf 


a2, (8.1) 


Where does the curve have extreme values or inflection points? Calculate the 
slope of the “inflection tangents”. 

Solution: 

Since the argument x occurs only in an even power, we have y(x) = y(-2). 
Therefore, the curve is symmetric with respect to the y-axis. Like the expo- 
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Fig. 8.4 Gaufs’s bell curve 


nential function, it does not have a root but approaches the z-axis asympto- 
tically on both sides. 


We compute the first derivative and use the abbreviation c = which is 


1 
Jon’ 


a constant factor. According to the chain rule, we have 


Since y # 0, we have y/ = 0 (extreme value) only for x = 0. There is obviously 
a maximum. In order to calculate the points of inflection, we need the second 
derivative. The product rule yields 


y" =-(L-yta-y’) =-(y-2?-y) =(a7-1)y. 


As might be expected, y’(0) < 0, which confirms that a maximum is present 
at x = 0. Inflection points generally occur at y/’ = 0. Since y # 0 always holds, 


x? —1=0 gives 2, = #1. The corresponding y-values are both equal because 
1/2 


= . The inflection tangents 


of the symmetry of the curve: yy = ce~ 
27 € 


have the slope yf, = #(-2w Yw) = £Yw- 

® Remark: The bell curve or normal distribution curve has a fundamental signifi- 
cance in probability theory. We will meet it again when it comes to the calculation 
of areas. The inflection points of the curve have to do with the so-called “variance 
of the distribution”. @ <a 


>>> Application: maximal circle of curvature of a parabola 
What radius of curvature o@ does the basic parabola y = ax? have at the point 
x = 0? For this, we use the formula 


3 
1+y’7)2 
y= (hy) 


. (8.2) 
Solution: y" 
With y’ = 2az and y” = 2a and using Formula (8.2), we get 
3 
(1+ (2ax)?)’ ‘ 
o(x) = = 0(0) = : daa 
2a 2a 
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>>> Application: osculating circles of the sine curve 
Show that the zeros and the points of inflection of the sine curve 
y =a sin(ba +c) are identical. Calculate the minimal osculating circle. 


Solution: 

Let y! = ab cos(ba +c) and y” = -ab? sin(bx +c) = —b? y. The conditions for 
the zero (y = 0) and the point of inflection (y’/’ = 0) are, therefore, identical. 
If sin(bx2 +c) = 0, then cos(bx +c) = +1. Therefore, we have the tangent of 
inflection with the slope tab. A minimum wz = 2, occurs if y'(xzs) = 0 and 


y(x,) = +a. Therefore, it follows that y"(x,) = tab. 
With Formula (8.2), we get o(x,) = 


abe’ 


Nt 

e, 
WW e race 
Na 


Fig. 8.6... and in practice 


For the prototype of the sine curve y = sinz, we have a= b=1 and c= 0. 
The tangents are, then, inclined by 45°, and the maximal osculating circle 
has the radius 0(x;) = 1. Fig. 8.5 shows a general sine curve with a number 
of osculating circles. The position of the midpoints of the circles of curvature 
(light blue) is called the evolute. This curve has its points at infinity in the 
directions normal to the tangents of inflection. <<< 
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8.3 Optimization problems 


So far, we have determined the extremal values of a function y = f(x) by 
computing the zeros of its first derivative. This is a very common technique 
in applied mathematics. The most striking task in optimization problems is 
to find the target function f(x). The variables need not be called x and y. 
We will give some examples in order to illustrate how to deal with so-called 
“optimization problems”. 

>>> Application: cardboard box of maximal volume 

A rectangular cardboard sheet is to be formed in such a way that the paper 
box with the largest possible volume is produced. 


b 


Fig. 8.7 box made from a rectangle 


Solution: 

With the conditions of Fig. 8.7, the dimensions a and b of the sheet are given. 
We now cut small squares of side length x at the four corners. When we then 
fold and join the resulting rectangles, we have constructed a cardboard box 
with the dimensions (a—2x)-(b-22)-x. Its volume is V(a) = (a-2x)(b-2x)z, 
and it depends on x, which is to be determined such that V(x) attains a 
maximum. V(az) is our target function. The condition for a maximum is 
V'(x) = 0. We expand 


V(x) = (ab- 2ba - 2ax 4 4a”)x = 42° - 2(a + b)2? + abs 
and, therefore, 
V' (a) = 1227 - 4(a+b)x+ ab. 


In order to find the zeros of V'(x) = 0, we have to solve a quadratic equation: 


_ A(a tb) £ /16(a+ 6)? —4-12- (ab) 


x : 
1,2 24 


We divide by 4 to simplify the equation and get 


(a+b) +/(a+b)? Bab _ (a+b) + Va? —ab+b? 
6 6 , 


£1,2 = 
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Which of the two solutions is the right one? We can recognize the correct 
solution either from the computer drawing, or we find the second derivative 
— which is justifiable in the present situation: 


V"(x) = 242-4(a+Db). 


We substitute x1,2 and obtain 


+b) +Va?-—ab+ b? 
Vey) -04 ev! 4(a +b) = 44Va? —ab+ B. 


6 
Therefore, the minimum is V(2) since V"(21) > 0, while V" (x2) < 0 yields 
the maximum because V" (a2) < 0. <<< 


Fig. 8.8 light intensity 


>>> Application: optimal lighting (Fig. 8.8) 
How high does a light source have to be placed above a horizontal plane so 
that a certain point P at a fixed lateral distance a is as well illuminated by 
the source as possible? For the illumination intensity F as a function of the 
distance r, the angle of incidence a, and the (constant) intensity J of the 
light source (light intensity), we have the equation 

B= e cos a. 

? 

Solution: 
The so-called target function EF depends on two variables, namely r and 
a. However, the two variables are not independent of each other, because 


a, a 
— = sina, and consequently, r? = — z~- Therefore, we have 
r sin* @ 
a 
E(a) = sin? a cosa. 
a 


J 
The positive factor — only causes a scaling of the target function. As a 


a 
result, the simplified function 


E(q) = sin* a cosa 
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will have the same extreme values for a. For these extreme values, the follo- 
wing must apply: E” = 0. Of course, the prime ’ indicates the derivative with 
respect to the variable a. We apply the product rule and get 


dE (a) 


Ho)= 


= 2sinacosacosa + sin? a(—sina) = 0. 


We can assume sina # 0, and thus, a # 0. This is only possible if the light 
source is directly above P. Thus, cancelling sina is permitted, and we find 


2cos? a — sin? a = 0 > 2cos? a = sin? a > tan?a=2=> tana =+vV2. 


Therefore, for optimal illumination, the light source has to be at a height of 


| ees Te oe 
tana /2) 7 


>>» Application: beams with maximal load bearing capacity (Fig. 8.9) 

A beam of greatest bearing capacity W is to be cut out from a cylindrical 
tree trunk of a given diameter d. What is the ratio width : height (b: h) when 
the following function should achieve a maximum: 


W=cbh? (c>0 material constant)? 


Fig. 8.9 load bearing capaci- ‘Fig. 8.10... and in practice 
ty in theory ... 


Solution: 
W depends on 6 and h. On the other hand, b and h depend on each other by 
the relation b? + h? = d?. Thus, 


W (b) = cb(d? - b*) = c(d?b- 6°). 
W(b) is maximal (or minimal) if the derivative with respect to b vanishes: 


dW 2 2) _ 
ib =c(d° —3b°) =0. 
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Therefore, one obtains } and, as a further consequence, h from 


b= (2) hs M2 a bshsd= 1 V2: VI 


® Remark: Fig. 8.10 shows that beams with this cross-section are, in fact, used in 
practice. The reasons behind the collapse of a building (such as the former Reichs- 
briicke in Vienna, which cracked in 1976) depends on a variety of factors. Naturally, 
one reason is that the load bearing capacity increases only with the square of the 
scale, while the dead weight of objects increases by the power of 3. This means that 
the load-bearing beams of a model of a bridge can hold significantly more than the 
beams of the real bridge (see Chapter 3). ® <<< 


>>> Application: optimal roof inclination 

How large should the inclination angle a of a roof’s surface be for the rain- 
water to flow off as quickly as possible? 

Solution: 

The time t required for rainwater to run down shall be given as a function of 
the inclination angle a. Let Ax be a small piece in the horizontal direction. 
The corresponding rafter length is, then, 


Ax 


As= : 
cosa 


The height difference is Ah = Ax tana. The increase in speed, i.e. the run-off 
acceleration, is thus 


Av= \/2g Ah = \/29 tana Ag. 
The increase of elapsed time At when running off by As equals 


Ax 
At = As _ Cos @ 


Av \/2g tana Ax 


Thus, the square of the increase of elapsed time reads 


(Ax)? 


(At)? = cosza _ ol Ag =! Ax _1 Ag 


2gtanaAz 2gcos*atana g2sinacosa gsin2a 


If the increase of time should be minimal, then so should its square. However, 
the reciprocal of this is then maximal. Thus, we can say, At becomes minimal 
if the expression sin 2a becomes maximal, thus 


+ sin2a = 0. 2cos2a 0 > 2a = 90° > a = 45°. 
a 
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® Remark: With an inclination angle > 45°, the run-off path that is required to 
overcome the horizontal distance Ax already increases to such an extent that more 
time is required for draining the rainwater. Therefore, the angle 45° represents the 
best “compromise” between the brevity of the run-off path Az and the acceleration 
caused by the slope. ® <<< 


>>» Application: Snell’s law of refraction (Fig. 8.11) 

We consider two optical media J and JJ with different optical densities in 
which light propagates at respective speeds c; and cg. The two media are 
separated by a single layer. Experiments have given consistent results and 
confirmed that the law of refraction comes into effect at the transition point 
(kink). Let the respective angles a and be enclosed by the incoming and 
the outgoing ray and the layer’s normal. Then, Snell’s law reads 


sina Cc] 


sinB C2 
Show that this law, named after Snell, is based on Fermat’s principle, accor- 
ding to which light “looks for’ that path from A to B where it reaches B in 
the shortest possible time. 


Fig. 8.11 total time function during refraction and imaging 


Solution: 

The problem can be considered in the plane ¢, which contains A and B and 
is perpendicular to the planar interface 7. We identify the intersection line 
enn with the z-axis and assign the coordinates A(0/y1) and B(d/y2) to the 
two points. The kink shall have the coordinates R(a/0). The following holds: 
time = distance / speed. Therefore, the “total time function” has the form 


LAR FB Vie Ta 
Cl (o>) Cy c2 , 


t(x) 


If the total time is minimal, then t’(a) = 0 has to hold: 


oe 1 Qu | 1 2(d-2x)(-1) 


C1 2Q/yi+a2 ©22 ye+(d—a)2 
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or, equivalently, 
1 1d- 
PGs 22 25. ep), 
Cc AR c. RB 


d- 
With — = sina and = = sin 8, we simplify the equation to 
R RB 


1. : 
— sina - — sin =0. 
C1 c2 


From this, we find the law of refraction 


sin @ _ C1 

sinB cy’ 
® Remark: If we consider all the rays which pass through B, and apply the law of 
refraction to them, the outgoing rays in their extension to the side of B form a 
curve called the diacaustic (Fig. 8.11, right). An observer at A can now only guess 
the position of B and assume it to be at the contact point with the diacaustic B*. 
The vertical bar through B appears strangely “uplifted”. ® <<< 


>>> Application: the most voluminous conical cocktail glass (Fig. 8.12) 
At which opening angle 2a does the conical cocktail glass have the largest 
volume with a given slant height s (cone)? 


Fig. 8.12 conical cocktail glass Fig. 8.13 optimization 


Solution: 
The volume of the cone is 


1 1 . 1 ‘ 
V==ar'h= 37 (s sina)? s cosa = rae sin” a cosa. 


2 


It becomes maximal if the expression V(q) = sin* a cosa becomes maximal, 


3 = C= 
ie. if —V =0: 
da 
2sinacosacosa + sin” a(-sina) =0=> 2sina cos a = sinasin” a. 


The case sina = 0 > a = 0 leads to a minimum, so that we can reduce by 
sina # 0 and obtain 
2 


a => tan?a=2=> tana=V/2>a#55°. 


2cos? a = sin 


Thus, the opening angle of the cone is 2a = 110°. <<< 
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>>> Application: maximum trajectory length on a hillside (Fig. 8.14) 
What is the angle a at which a projectile has to be launched to get as far as 
possible in a terrain with inclination angle ¢? 


e>0 


Fig. 8.14 trajectory on a hillside 


Solution: 
We have often worked with the ballistic trajectory (parabola): 


(1) w=votcosa, (2) y=votsina- st 
Now, we intersect it with the inclined terrain 
(3) y=axtane=kz. 


From (1) and (3), we conclude 
y=kx=kuotcosa=> ytana=kvotsina. 
Substituting this into (2) yields 


_ytana g y? 
Ok 2 k?v2 costa 
We may reduce by y if we exclude the trivial solution y = 0: 
tana 2 sin a 
gd, = = l>ye= Kvpcos*a( -1) 
2. Kets COs? a k g kcosa 
2 
=> y = —kup(sinacosa— kcos’ a). 
g 
The value y is an extremum if the expression 
: sin 2a 
f(a) = sinacosa - kcos” a = — kos” a 


is an extremum, which happens for f’(@) = 0: 
1 
f'(a) = cos 2a + 2k cos asin a = cos 2a + ksin 2a = 0 > tan 2a = = 


Numerical example: 
é = —45° (on a downward terrain): 


k=tane=-1=> tan2a=1 => 2a = 45° > a= 22.5°, 


€ = 45° (on an upward terrain): 


k = tane =1=> tan 2a =-1 => 2a = 135° (!) > a = 67.5°. — 
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8.4 Series expansion 


Up until a few decades ago calculation with trigonometric functions, loga- 
rithms, and exponential functions was a very tedious task. If one wanted to 
work with precision, one had to constantly refer to tables and then interpo- 
late between the table values. In applied mathematics, one was satisfied — 
justifiably — with less precision and worked with the “slide rule”. 

With the advent of electronic pocket calculators, sine values or logarithms 
appeared on the “flashy” display (in less than a second). Today, an ordinary 
PC needs only (a fraction of) microseconds to accomplish the same task. 
How is this possible? 

The solution to the problem is that non-algebraic (transcendental) functions 
can be approximated by algebraic power functions which are called power 
series. For the latter functions, one “only” needs to be able to multiply and 
divide if one knows the derivatives of the functions. 

With certain limitations, Taylor’s fundamental formula holds: 


2 CK) (op 
f@)=> f°) (g oe)". (8.3) 


ko 


The value 29 is called the knot. The number k! = k-(k-1)-(k-2)-...-3-2-1 
is called the k factorial. It proves to be convenient to define 0! = 1 
The prerequisite for this series is that |” - v9| has to be “small enough”. In 
many cases, |x — Z| < 1 is sufficient. 
Proof: Written in detail, we obtain 7 

erent Z 0) (oxide is (0) (eae)? i 0) 
We make the ansatz 

f(x) = ag + a4 (a — 9) + ao(ax — 29)? + ag(a - 29)? 
and attempt to determine the coefficients a,. This is done by repeated differentia- 
tion: 

f'(z) = 1-4, +2-ao(a-29) + 3-a3(%- 29)? +4-a4(x2- 29)? +..., 

f"(w) =2-1-a24+3-2-a3(%—2%9)+4-3-a4(a—29)*+..., 

f'" (a) =3-2-1-a3+4-3-2-a4(x— 20) +5-4-3-a5(a- 20)? +.... 
Assuming x = Zp in all these equations, we obtain 


f(a) = a9 > ao = f (20) 0) (since 0! = 1), 


f'(@o) =1-a1 > ay = f' (a0) a to) (since 1! = 1) 
f"(@o) . 


2! 
m 
f'’ (xo) = 9-2-1+ag > 43 = ——— f iM) 


(x—a)? +.... 


f" (a0) = 2-1- a2 > ag = 
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(k) 
f09(¢9) = he (R= 1) 00-32-10 > ay = EO), 0 


For the knot xo = 0, the formula was already given before Taylor by Maclau- 
rin: 


f(x) = f(0) 4 fr), fo) 2 fo) .3 Peeed (8.4) 


Again |x — x9| has to be small enough. Formula (8.4) allows us to calculate 
most of the basic transcendental functions. 


>>> Application: series expansion of the exponential function 
Show that the exponential function is given by 


se og gt 
€ SLES Fy aye ay tie (8.5) 


Solution: 
We simply “prove” this by direct calculation: The Euler function can be 
differentiated as often as desired and does not change: 


Clee =.s8" Gee. 
Therefore, e‘”) (0) = e° = 1, and thus we have already verified Formula (8.5). ««< 


It is barely any more difficult to calculate the series expansion of the sine or 
cosine function: 


11 
a? 
"or 10 


3 5 
ee / : 
fio) aie | 


Fig. 8.15 approximation of the sine function by Taylor series 


>>» Application: series expansion of the sine function (Fig. 8.15) 


Show that a ee ae ro 
sin = Be te ay tees (8.6) 

2 ,4 26 
Ce a eee sale ee (8.7) 


2! 4! 6! 
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Proof: We have (sinx)' = cos x, (sinx)” =-sin(x), (sina)’” =-—cosax. The fourth 
derivative is again the function itself ((sinz)) = sina). So, the derivatives repeat 
themselves periodically. Therefore, sin0 = 0, sin’0 = 1, sin’ 0 =0, sin’”0 = —-1, and 
thus we have the series for the sine function in Formula (8.6). The development of 
the cosine function is done analogously. © <<< 


>>> Application: useful approximations of trigonometric functions 
Show the following frequently used approximations “for small x” and test the 
quality of the approximation for different x < 0.5: 


sing » tana» \/2(1-cosxz) ® a. 


y \ y=tan 


y=-y201 -COS X 


Fig. 8.16 four functions which differ little in the neighbourhood of x = 0 


deg. | rad. x | error [%] | error [%] error [%]| for 
180 x for sing | for tanz | \/2(1-cosz) 
5 0.09 0.1 0.3 0.03 

10 0.17 0.5 1.0 0.13 

15 0.26 1.1 2.4 0.29 

20 0.35 2.0 4.3 0.59 

25 0.44 3.1 6.9 0.71 

30 0.52 4.5 10.3 1.14 

fia 
Proof: The approximation of sinx » x follows directly from sin x = x# - a +. 


x+2°, because x° is already very small for small 2. Further, 


x x 
tan x = 5 XL. 
cos x ue 
-—+. 
2! 
Finally, 
V 2(1-cosx) = \ (1-5. 
PAI 
For negative x, the negative expression — pa —cos2z) is to be used. © 


® Remark: From the table, one can see: The error made by the approximation of 
the sine function for x° < 20° and of tangent function for x° < 15° is less than 


384 Chapter 8: Infinitesimal calculus 


2%. The best approximation is given by \/2(1-—cosx), where the error is only 
worth mentioning for 7 > 30° (which can also be seen in Fig. 8.16)! We need the 
given approximations, for example, in order to derive the oscillation duration of a 
pendulum (Application p. 420). @ <<< 


We need Taylor’s Formula (8.3) for the series expansion of the logarithm: 


>>» Application: series expansion for lnx (Fig. 8.18) 
Show that the natural logarithm can be approximated by 
(e-1)?  (@-1)3_ (@-1)4 | 

2 T 3 4 Tee ee 


Inz = (a-1) (8.8) 


Solution: 
The derivatives of the function f(«) = nz are f!= 42 a7', f" = (-l1)x?, 
fa(-1)-2)e?, £9 = C1) 2)(-3)8 cer. 
Since we cannot divide by 0, we expand at the knot xo = 1. There we have: 
fil) =0, FU) 1, F =), 
f"(1) = (-I)-2), FO (1) = (-1)(-2)(-3), «--. 


Obviously, the derivatives at the knots are 


Pe) ep 
kl k 


and Formula (8.8) is verified. <<< 


The number of terms to which the series must be expanded in order for the 
result to be accurate enough depends on the respective series and also on how 
much the function value x differs from the knot xg. Usually, the expansions 
are stopped somewhere between the 11th and 15th power. 


>>> Application: approximation formula for body extension 

(Application p. 386) 
If the temperature of a body of length Ly, is increased by At, its length 
increases to L2 according to the formula 


Ig= 1, v/1+7At. 


There, y is the material constant. Show that in the case of a small expansion 
in length, one can do well with the approximation formula 


Ig = 1, (1+2at). 


Solution: 
We set x = y At and expand the function 


f(x)=Vl+a 
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in a power series at the knot 0: 


1 1 2 
f(x) =(1+2)3 = f'(a) = A +2) 3 = f" (2) = - (-5) (1+2)°3 , ete. 
At the knot 2 = 0, we have (1 +2)” = 1 for each exponent u, and thus, we 
obtain 


For small x, x?/9 is still much smaller (e.g., 2 = 0.001 > x?/9 x 0.0000001) 
and can be neglected, because length measurement is limited in any case. «<< 


>>> Application: conversion of fisheye photographs 

If you want to photograph as much of the interior of a room as possible, 
you have to use a wide-angle lens. The spiral staircase in Fig. 8.17 could 
only be captured in its entirety with a so-called fisheye lens. Such images can 
be transformed by means of a power series expansion into ultra wide-angle 
images (Fig. 8.17, right). 


Fig. 8.17 left: fisheye image, right: wide angle view 


Solution: 

An ordinary photograph is a “perspective”, which is a projection from the 
center of the lens system onto the photosensitive layer. This mapping is “line 
preserving”: Lines in space are mapped to lines in the image plane. Let P° 
be the perspective image of a point P in space. Its distance from the center 
of the image (the principal point of the perspective) equals r. 

Let us distort a perspective image so that all points are stretched or com- 
pressed from the center by a scaling factor x = f(r) that is dependent on the 
distance r. This scaling function f(r) can be any continuous function. 
Fisheye images actually arise as follows: Image points far from the principal 
point are “shifted” towards the center. For a given fisheye image (a is known), 
“only” the inverse function g(a) = f~'(x) that shifts back the points has to 
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be found. Irrespective of what the function g(a) may look like, it has a power 
series expansion at the knot x = 0 of the form g(x) = P25 ax a*. Since the 
distortion is close to zero near the image center, we have g(x) = x for a small 
x. Therefore, aj = 0 and a, = 1. The series expansion, thus, has the form 


g(a) * x4 a2” | a3x° | a4 x" ds x ee 


In the computer age, the coefficients can be found through interactive testing. 
Thereby, one must try to straighten curved images of rectilinear edges (Fig. 
8.17, right). Once a fisheye lens has been calibrated, each additional image 
is quickly rectified. <<< 


>>> Application: convergence problems of the sine function 

Up until this point, we have assumed the convergence condition |a — xo| as 
being “small enough”. It is convenient to take |a — | < 1, because (x — a9)* 
goes towards zero for large k. What does the computer do when, for example, 
sin 7.3 is to be calculated? 


Solution: 

First of all, arbitrary multiples of 27 can be added or subtracted to x without 
changing the sine value. Thus, sin 7.3 = sin(7.3- 277) = sin 1.01681. This value 
does not satisfy the condition |x| < 1 either. However, we can use the formula 
sinx = cos(z/2- x). Therefore, sin 1.01681 = cos0.55398 where the cosine 
series converges. 

The method initially described appears to be quite complicated. That is why 
it took such a long time for the sine value to become available at the “touch 
of a button”: It requires the execution of a computer program covering all 
possible cases. This program is now usually “hard-wired in a chip”. That is 
to say, it is implemented into the hardware and thus works quite fast. «<< 


>>> Application: convergence problems of the e-function or the |n-function 
Calculate e~?*4 or In 16.98 through series expansion. 


Solution: 


1 
We can restrict the domain of x to positive values because e~” = = Yet, 2.34 


does not fit into the desired range of x < 1 either (due to rapid convergence). 
If, however, e = 2.71828... is stored, then e?"*4 = e- e- e934, and for x = 0.34, 
the series expansion converges very well. 

For the calculation of Inz, the convergence interval is to be found in the 
neighbourhood of 1 (Fig. 8.18). Numbers outside the interval [1,e] will be 
divided by e or multiplied by e, until the number lies in the desired interval. 
Each division/multiplication decreases/increases the logarithm by 1. If the 
number is less than \/e = 1.64872..., then we can start the series expansion. 
Otherwise, we divide by \/e, decreasing the logarithm by 0.5, and then, we 
expand. In the specific case, 16.98 can be divided twice by e and once by \/e. 
The logarithm of the quotient 1.3938... can then be calculated (0.33204. ..). 
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y=In x 


(1/0) 


Fig. 8.18 y =In2, polynomial of deg. 6 Fig. 8.19 osculating parabola of deg. 2 


We have to add 2.5 to this result. Fig. 8.18 shows the poor approximation of 
the Taylor function if x is too far from 1. In the case of the logarithm, even a 
development to very high powers results only in a small improvement. <<< 


>>> Application: osculating parabola (Fig. 8.19) 

The first three terms of the Taylor series indicate the equation of an “oscu- 
lating parabola” of the function at the knot. Such a parabola approxima- 
tes the function graph there very well. Find such an osculating parabola of 
f(x) = cos 2x at the point zo = 0. 

Solution: 

The series expansion of f(a) is given by Formula (8.6) 


(20)? (2a)* 
2°! 


f(a)=1 


So, a9 = 1, a, = 0, and ag = -2. The osculating parabola has the equation 
y=1-2 ax. 
Fig. 8.19 illustrates how well the parabola approximates the curve. <<< 


>>> Application: three types of surface points (Fig. 8.20) 

One can extend the considerations of Application p. 387 to three-dimensional 
space. We consider an arbitrary point P of an arbitrary doubly curved surface 
and impose a Cartesian coordinate system such that P is the origin and the 
surface normal at P is the z-axis. Each planar intersection of the surface 
through the z-axis yields a (planar) curve which has an osculating parabola at 
P. It is possible to show that all these osculating parabolas form a (hyperbolic 
or elliptic) paraboloid or a parabolic cylinder (Fig. 8.20). Consequently, one 
speaks of an elliptic, hyperbolic, or parabolic surface point P. 

During the investigation of even the most complicated surfaces, it is now 
possible to relate to these three types of surfaces, and no further cases need 
to be distinguished. 

In particular, the following important theorem holds: 
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Fig. 8.20 elliptic, hyperbolic, and parabolic point on a surface 


A surface is developable if, and only if, it consists of parabolic points exclusively. 
A necessary (but by no means sufficient) condition for the developability is 
that the surface consists of straight lines. Visually, a developable surface can 
be recognized by the fact that all parts of the contour are rectilinear in any 
viewing direction. 


This shows that doubly curved surfaces such as a sphere, a torus, a hyper- 
boloid etc. cannot be developable. Arbitrary cones, cylinders, and tangent 
surfaces of space curves (frequently called developables, rarely called torses) 
can be developed into a plane. 


Fig. 8.21 The oloid belongs to the class of developable surfaces. All of its generators 
are of equal length. Its body can skillfully be turned over by means of two modified 
universal joints which is exploited technologically (for example, for the efficient 
mixing of pond systems). 


® Remark: If a surface is developable, then there exist cones or cylinders touching 
the surface along each generator (straight line). Consequently, the contour of the 
surface consists only of parts of straight lines. This is a very good distinctive feature 
of a developable surface. Fig. 8.21 shows the rotation of an oloid (wobbler) by means 
of two Cardan joints. In each situation, the object has rectilinear contours. It can 
be developed. See the wobbler.exe demo program. ® <<< 
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8.5 Integration as the inverse of differentiation 


We can already differentiate functions. The question arises if a prescribed 
derivative f’(x) can already determine the function f(z). 

The answer is: In principle, yes. If we have the derivative f’(2) for every 
z-value, then we also know the tangent direction. If a point Po of the curve 
with the x-value xp is known, then we go a little bit along the tangent to 
a new point P, with the z-value 7; = xp + Ax. There, we again know the 
tangent direction and arrive in an analogous way at a neighboring point P» 
with the z-value x2 = 2, + Az, etc. In this way, we obtain an “approximating 
polygon” which will approximate the function graph of f(x) more accurately 
for a smaller step size Ax. The polygonal curve converges to the function 
graph f(x) for Ax > 0. 

A small problem still remains: We must know the starting point Pp, i.e. the 
function value of f(29). Do we really have to know the starting point P5? 
If we choose an arbitrary point with the xz-value xo as the starting point, 
we still get a function whose derivative is always f’(2) according to our 
construction. This curve differs only by parallel displacement along the y- 
axis from the original solution. 

In the following, we call a function G(x) the anti-derivative of the functi- 
on g(x) if G(x) = g(x). Clearly, f(x) is an anti-derivative of f’(2). Then, 
the following theorem is valid: 


Each function has infinitely many anti-derivatives which can be made to agree 
up to an additive constant through parallel displacement. The graphs of all 
anti-derivatives are, thus, congruent to one prototype. 


The construction described above is called integrating the function g(x) and 
the anti-derivative G(x) is the indefinite integral of the function (indefinite, 
because parallel shifts along the y-axis are possible). Symbolically, we write 


G(a)= f 9(x) ae. 


In particular, the following holds 
f(a)= f f(a) de. 


>>> Application: graphical integration 

The Coradi integraph (Fig. 8.22) draws an anti-derivative for a given input 
function. The graph of the function to be integrated (the upper curve) has 
to be traced with a stylus. 

The angle between the small cutting wheel and the horizontal axis is adjusted 
by a complicated linkage proportional to the function value (point on the 
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Fig. 8.22 Coradi integraph (right: enlarged) 


curve). As a result, the lower curve is drawn by a pencil. At each point of this 
curve, the function value of the input curve is the slope of the tangent. «<<< 


Differentiation and integration are somehow inverse operations, such as mul- 
tiplication and division, squaring and finding the square root, evaluating sine 
and arcsine, or e* and Ina. 

Thus, we can already prove the following important rule: 


n+1 


| eas errs (n€R, n#-1). (8.9) 
ntl 


Here, C’€ R is called the constant of integration. 


Proof: We only need to show that if the right-hand side is differentiated, the 
left-hand side of the equation emerges. In fact, according to the already known 
differentiation rules, we have 


n+1 7 ’ 
1 1 
x LO = em) = ——.-(n+1)-a™b l= 2". 
n+1 n+1 n+1 © 
In Formula (8.9), we have to exclude n = —1, because we cannot divide by 


1 
zero. We also know the result of [ «7! dx = { — dz: 
x 


[ dese ee. (8.10) 
xv 


Taking the absolute value |x| of x is necessary, because the logarithm of a 
negative number is not defined within real numbers. 
Proof: We distinguish two cases and differentiate the right-hand side again: 


z>Os|el= x: (nvz+C/=-, 
x 


u<O=|a)=-x: (In(-x)+C)/= : = © 


—2£ 4 0 


Furthermore, the following can be easily proven by differentiating: 
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J sina de =-cose +C, jf cosade=sine+C, if faz-e+0. (8.11) 


In practice, the following theorem in which G(x) is the anti-derivative of 
g(x), ie. G'(x) = g(x), is often used: 


[ sax) dx = ~G(ax+b)+C. (8.12) 


Proof: We apply the chain rule 
1 ri 

( G(aa +b) +4 c) =o: 
a a 

>>> Application: Calculate the following anti-derivatives: 


z+ 
f sin(ax +0) de, [ Qx-3)% de, fer de, ae 
1-32 


G'(ax+b)a=G'(ar+b) =g(ax+b). © 


Solution: 
Using Formula (8.11) and Formula (8.12), we get 


1 
Fs sin(ax + b) dx = —(-cos(ax+b))+C; 
a 
using Formula (8.9) and Formula (8.12), we get 


_9y7 
f 20-3) dx = OP“ g, 
2 7 


using Formula (8.11) and Formula (8.12), we get 


atl] wed w+ 
e3 dzr=7:e 3 +C=3e 3 +C; 
3 
using Formula (8.10) and Formula (8.12), we get 
d i 
~_ = —In|1-32|+C. 
1l-3x -3 


® Remark: Of course, the notation of the variables does not matter. It is also correct 


to write { sinada =-cosa+C or f dt=t+C, etc. ® <<< 


More generally, the following formula is valid: 


[ g{h(x)] A'(x) dx = G[A(x)] + C. (8.13) 


Proof: Analogously we use the chain rule. © 
Thus, in the case of nested functions, the derivative of the inner function has 
to show up “somehow”. 


>>> Application: Calculate the following antiderivatives: 


J sin? ccosedx, [ 2xe ar, f= inxde, f (32? -1)% dr. 
x 
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Solution: 

The following solutions shall be checked by means of differentiation. 

[ sin? xcosx dz = f[[sinz]? cosx dz =? 

We set g(x) = x? and h(x) = sin. Then, h’(x) = cosa. Thus, the derivative 
of the inner function is already there, and we can write 


sin? x 


3 


J (sin a] cos dx = +C. 


f Qre® dx = e* +C, since 2x is the derivative of 2. 
3 1 (Ina)? ae a 
f-Inedr =3- f —Inxdz = 3~—~ +C, since = is the derivative of Inz. 
x x 2 is 


1 1 (347-1)? 
f x(3x? - 1)® dx = gf 6a(3a* =1)* des Gray 


derivative of 32”. «<a 


+ C, since 6x is the 


If the derivative of the inner function does not occur, one must try to rewrite 
the integral with all possible tricks. We will learn about this in the next 
section. Sometimes even a clever transformation of the integral does not lead 
to success. Thus, for example, the integral which is significant in probability 


theory 
7 e dx 


can only be evaluated numerically, to the chagrin of the user. 

Without admitting subtleties of the integral calculus, we will consider some 
important applications in the next section. For this, we need the fundamental 
fact that one can determine surfaces with antiderivatives. 

To conclude this section, let us look at a more practical example: 


>>> Application: optimal lawn sprinkler 
A lawn sprinkler moves in such a way that it evenly irrigates a rectangular 
strip of lawn. What does this motion look like? 


Fig. 8.23 normal and optimized lawn sprinkler 


Solution: 
At any moment, water leaves the lawn sprinkler at point A with the initial 
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speed vo and moves on a parabolic trajectory until it hits the lawn. The 
“throwing distance” w is calculated in Application p. 142 as 


1 
w(a) = —vasin 2a. 
g 


Fig. 8.24 lawn-sprinkling variations and the corresponding results of a simulation 


If the angle a@ is continuously changed with the time t 
a=a(t), 


the impact point of the water B travels back and forth. Its distance from the 
point A is, then, w(a(t)). At a = 45°, the water reaches the maximum distan- 
ce. We obtain the instantaneous “initial velocity” vg of B by differentiating 
with respect to time: 


dw da 2 d 
UB = ° = (chain rule) = * : 7A = 7 Nocos 2a a 


For optimal irrigation, vg has to be constant (co is constant) such that 


da 1 
dt cg cos2a’ 


This equation is called a differential equation. If we want to solve it, we have 
to separate the variables. Then, we obtain 


dt = co cos 2ada. 


Now, we can integrate both sides 
a dt = [ co cos 2a da 


t= al sin 2a. 
2 


and obtain 


From this, a(t) can be calculated: 


1 _ 2t it ; 
q@ = —arcsin = —arcsinkt. 
2 Co 2 
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0% 50% 100% ft 


Fig. 8.25 optimized function a(t) for the lawn sprinkler 


The solution is by no means trivial. Since the arcsin is defined only for 
arguments with value < 1, one has to use the formula 


1 
oe — arcsin(kt - 2) 
2 2 


for kt > 1. Considering the graph of the described function (Fig. 8.25), we see 
that, in practice, the lawn sprinkler should be realized either only for angles 
up to 40° or for angles over 50°: If a = 45°, the motion is too jerky due to the 
vertical tangent. In the restricted intervals, we can then limit ourselves to a 
linear growth of a with increasing time. In any case, the — quite common — 
lawn sprinklers with a “harmonic” sway are not optimal, as one recognizes 
from the illustration on the left of Fig. 8.23. <<< 
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8.6 Interpretations of the definite integrals 


The following theorem is of great importance to us: 


Let G(x) be the antiderivative of g(x) so that G’(x) = g(x). Then, the definite 
integral of the function g(x) 


[ aie a(a)), = G(b) - Ga) (8.14) 


gives the measure of the area underneath the curve g(x) in the interval [a, b]. 


® Remark: Usually, we are not so nit-picky when using certain expressions. One 
could simply say area instead of the measure of the area. We do not want to fix cer- 
tain physical dimensions. If the abscissa, for instance, is considered as the attractive 
force and the ordinate is considered as the distance (Application p. 403), then the 
measure of the area is the work (or potential energy) performed in the time interval 
[a,b], since work = force x distance. ® 

Proof: We decompose the area between the x-axis and the curve g(x) into small 
rectangular strips of width dx. The area starts at zero and increases by the value 
g(a) -dx with each rectangular strip. 

Now, consider the function G(x). It can have any value G(a) at the point a and 
increases by a very small dx in the same way as its tangent does, hence by G’(x) - 
dx = g(a): dx. At b, it reaches the value G(b) and is, thus, increased by the value 
G(b) - G(a). 

The increase of the area and of the antiderivative are identical. Therefore, G(b) - 
G(a) gives the total area. © 


® Remark: With a variable upper boundary x, we have 


Fi g(u) du = G(x) — G(a) = G(z) +C, 
and so, we obtain the relation between the definite and indefinite integral. ® 


>>> Application: area underneath the sine curve 
Compute the area that is enclosed by a hunch of the sine curve and the 
@-axis. 


Fig. 8.26 area underneath the sine curve 
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Solution: 
Two adjacent zeros of the sine curve are a = 0 and b= 7. In between, the sine 
curve is positive; hence, the area A is also positive, and we have 


A= ti sin x dx = ~ C082] = — cos m — (-cos0) = -(-1) - (-1) =2. 


In the final step, a series of minus signs appeared and it is easy to make 
mistakes. In such a case, one should use the immediately obvious formula 


b a 
-G(a)| = +G(a)|.. 


Here, we have 
T 0 
- cos | = +cosa| = cos 0 - cosa = 1- (-1) =2. 
0 


TT 
The definite integral “responds” to a sign change and then returns a negative 
area. In fact, 


20 20 0 
[ sinx dx = ~ cosa = + cosa] =cosQ-cos27=1-1=0 
0 0 20 
has not calculated the total area in the usual sense, but in the best case, one 
has proven the trivial fact that the two signed surfaces “cancel” each other 
(Fig. 8.29). << 


>>> Application: area of the circle and the ellipse (Fig. 8.27, Fig. 8.28) 
Calculate the formulas for the areas of the circle and the ellipse by means of 
integration. 


Solution: 

Today, everyone knows the formula for the area of a circle. The derivation of 
this formula requires integral calculus. 

A circle around the origin with radius r is given according to Pythagoras by 
the implicit equation x? + y? = r?. Explicitly, y can only be defined ambi- 
guously by y =+V r2 -— 2. Therefore, we have to exclude the lower semicircle 
and calculate the area under the upper semicircle. This means 


+r es, 
-f Vr? — #2 de. (8.15) 


For a “practiced integrator”, this integral may be easily computed. However, 
it requires a lot of computation and the solution is by no means visible to 
the “untrained eye”. 

Thus, we propose a different approach: We divide the circle into “infinitely 
many” sectors with the opening angle dy. The area dA of such an “elementary 
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sector” is equal to the area of an isosceles triangle with height r and base 


line rdy 
2 


r 
dA = — dy. 
a 


Now, we have 


Qn Qn p2 2 Qn 2 eee 
A= f aA= f[ r dy =" [ dp=—y =r’, 
0 0 2 2 Jo 2 'o 


Hence, the formula for the area of the circle is derived. We also know the 
result of the integral (8.15): 


ae 
fi Vr? — x? dx = (8.16) 


a 
Fig. 8.27 area of the circle ... Fig. 8.28... and the ellipse 


® Remark: In a purely heuristic way — even without integral calculus — we get an 
even simpler result: We consider the circle divided into n equally large sectors (Fig. 
8.27). Now, we put these sectors together so that the radii coincide and the arcs 
are alternatingly facing up and down as shown in Fig. 8.27. For n > oo, a rectangle 
with side lengths r and wr (half of the circumference) and area 7 r? is formed. © 
An ellipse “in principal position” with major axis length 2a and minor axis 
length 2b (Fig. 8.28) is described by the implicit equation 


i.e. explicitly by 


b --Z 
yY=rt— a? = 2. 


a 


Using Formula (8.16), we immediately get the area of the ellipse 


A +a 
—= nt ey 
a 


2 -a 


alo 


ca = A=abr. (8.17) 
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Fig. 8.29 signed area Fig. 8.30 average values 


If we know the area under a function graph f(x), we can calculate the average 
function value f. We have 


1 


See es b 
f= F(@) = —— 7 fle) de. (8.18) 


Proof: We will give a quite heuristic (and, therefore, hopefully more intelligible) 
proof: Let us imagine a thin-walled aquarium with many narrow vertical lamellas 
(Fig. 8.30). Now we fill water into each lamella until it is as high as the function 
graph on the glass disc. The amount of water filled in can be interpreted as a 
measure of the area ti f(x) dx under the graph. Now, we pull out the slats. Then, 
the water will form a thin parallelepiped with the same volume (and the same cross- 


sectional area). Its length is b- a. The height was the average water height f(a) in 
the individual lamella. The above formula follows from the area equality. © 


>>» Application: average value of the parabola y = x” in [0,1] 
Solution: 


The area of the parabola is 


1 n+l 4 1 
A= f ede =~ = . 
0 nt+lio n+l 


Thus, the average value is 


1 
1-0 n+1 
Therefore, the parabola y = x” divides the area of the unit square in the ratio 
1: n. In the special case n = 2, this was proven by Archimedes in the 3rd 


century BC using elementary methods! <<< 


>>> Application: centroid of the semicircle 
Calculate the average positive sine value and the centroid of the semicircle. 


Solution: 
With Application p. 395 and Formula (8.18) we find 


— 2 
|sina| = —. (8.19) 
1 
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Now consider a semicircle with radius r and divide it into many small sectors 
which can be approximated by triangles (Fig. 8.31). A triangle’s centroid S; 
has the polar coordinates x and or because the centroid divides the median 
in the ratio 2: 1. 

All triangles have the same area. So, they are “equally significant”. The posi- 
tion vector to the total center of gravity is, therefore, the arithmetic mean of 
the position vectors to the individual centers of gravity. The average abscissa 
is, of course, s, = 0. The average ordinate is the average of 2r sin x because of 
the uniform distribution of the individual centroids, i.e. in the limiting case 
22 4 


——— rx 0.424r. 
3 7 37 


Sy = ge a 


The formula can be generalized in order to find the centroid of an arbitrary 
circle sector with central angle a. We assume that the sector is placed sym- 


metrically to the y-axis and then use the same considerations to calculate 


the average sine value § in the interval |; - a . + |. which, according to 
Formula (8.18), gives 


ate 
be 5, 1 T a T a 2 .a 
s=— [ sinzdz = [cos( cos ( )|- sin —. 
a a 22 2- 2 a 2 


Fig. 8.31 centroid of the semicircle Fig. 8.32 centroid of an arc 


>>> Application: line centroid and areal centroid of a circular arc 
Calculate the line centroid of the arc e in Fig. 8.32. 


Solution: 
The points of the arc are parametrized by 


(r cosy/r sing), y€[a, 5]. 


Then, the line centroid of the curve has the average cosine and sine value 
multiplied by r for its coordinates: 
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L B 1 ; 6 sinB-sina 
COs @ [ cosy dy = B sin | = —_—_—__, 
a a 


~ B-a -a B-a 
5 il ae ; a cos a — cos 3 
In = 1n = = 
© B-a Joa Dae Br-a le Br-a 


® Remark: The centroid of the semicircle (Fig. 8.31) can obviously also be inter- 
preted as the centroid of the concentric semicircle formed by the centroids of the 
individual triangles. ® <<< 


Fig. 8.33 conversion of muscle energy into kinetic energy 


>>» Application: kinetic energy as a definite integral (Fig. 8.34) 


Derive the formula 


mv? 


2 


for the kinetic energy in a translatory motion. 


W = (8.20) 


Solution: 

A body that moves “has” kinetic energy. In order to accelerate a body from 
a speed v1 to a speed v2, energy (work) is required. 

In general, the infinitesimal energy increase dW at each moment is proportio- 
nal to the instantaneously applied force F' and the infinitesimal distance ds 
because of the relation work = force xdisplacement: 


dW = Fads. (8.21) 


Furthermore, force = mass x acceleration holds true where the acceleration a 
is the first derivative of the instantaneous velocity v at the time ft. So, 


F=ma=m eo 
dt 
Applying the chain rule, we find 
du ds 
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ds ' ; 
However, now Te v is the instantaneous velocity, and therefore, 


d 
Fog 


ds" 
So, together with Formula (8.21) 
dv 
dW = Fds= muds =mvdv, 
s 


and thus, 


fr 1 1 
W = [mow = =mvs- =m. 
2 2 
v1 


This energy must be generated in order to accelerate a body from v to v2. 
If the initial velocity v; = 0, Formula (8.20) for the kinetic energy holds. «<< 


Fig. 8.34 kinetic energy Fig. 8.35 potential energy 


>>> Application: potential energy 
Deduce the formula for the potential energy 


W=mgh (8.22) 


by means of integral calculus. 


Solution: 
Again, work = force x displacement. The force is the weight F’ = mg. The 
energy needed during lifting by the small displacement ds is, thus, 


dW = Fds=mgds. 


We lift the body from height h; to height hz and have 
ha ha 
wef mgds=mgs|_ =mghoa-mghyi. 
Uy 1 


For h = hg and hy = 0, we get Formula (8.22). <<< 
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>>> Application: potential energy with a very high difference in height 
Calculate the energy that is necessary to transport a body of mass m from 
the surface of the Earth to a very great height. 


Solution: 

The force which is to be overcome equals the weight F' = mg. Actually, this 
is the attraction force of the Earth onto our body. For small differences in 
height, the acceleration g of gravity only negligibly changes. Yet, “already” 
at a height R = 6,370km, (where R is the radius of the Earth; thus, we 
have doubled the distance to the center of the Earth), only one fourth of 
the attraction force acts according to the findings of Newton: The adjusted 
weight formula is, therefore, 


mg 
F(a) 7 “g2? 


where x is our current altitude measured in Earth’s radii from the Earth’s 
center. Let us represent the height x on the abscissa. 

Now, we think “infinitesimally” as we have frequently done: We lift the body 
a little bit by dz. In the small range of dx, the current acceleration due to 
gravity does not change. (This is what we mean when we say: “On a small 
scale,” things are often easier.) In order to raise the body a little bit by dz, 
we have to perform the work dW = wg -dx (work = force x displacement). 


The tiny amount of work dW can be interpreted as the area of the narrow 
rectangle F(a) x dx. The sum of all the work is the measure of the area 
underneath the curve: 


x2 x2 
W (2)? = [ mg dx=mg [ a? dx. 
xy x xy 


The present integral is easy to solve: 


So, we have 


If we set x1 = 1, that is, if we start from the Earth’s surface and let x2 move 
towards infinity (1/r2 > 0), we get the measure for the work in our adapted 
coordinate system: 

Wrs=mg. 


A unit corresponds to the radius of the Earth. If we want to proceed with 
the international measurement system, we set R = 6,370,000 m, and thus, 


Wr =mgR=FR. (8.23) 
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Thus, it is easy to multiply the force F' by the radius R of the Earth. Whatever 
this value is, it is not infinitely large, and that has a tremendous importance 
for all mankind. We do not need an infinite amount of energy to leave the 
Earth’s gravity field! <<< 


>>> Application: escape speed from the Earth 

Although this example does not require integral calculus, it must be mentio- 
ned here: Use Formula (8.23) to calculate the velocity that a rocket needs in 
order to leave the Earth’s gravitational field. 


Fig. 8.36 Space shuttle with carrier rockets: This object is not intended to leave 
the gravitational field! 


Solution: 

The work to be performed mg R is converted into potential energy in the 
physical sense. If the object (the space capsule) is dropped again, this form 
of the energy changes into kinetic energy mv? /2. The theoretical impact 
velocity (the air resistance only plays a role in the very last phase of the 
entrance into the dense atmospheric layers) can be calculated as: 


2 
mgR _ >2gR=v' > 2gR 


k k 
98 205 -6.37- 108m # 11.2~— » 40,000——. 
s Ss h 


As in “usual” free fall, v does not depend on the mass. Conversely, v is 
the critical velocity that the rocket must reach so that it does not become 
“trapped” by the Earth. 


® Remark: Of course, “one has to keep some fuel in reserve”, in order to be able 
to change the course in space at a later point. For flights to the Moon, the space 
capsule is also being “captured” by the gravitational field of the Moon. © <<< 
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Volume and surface area 


The following practical and often applicable theorem is about 400 years old 
and comes from Bonaventura Cavalieri: 


Cavalieri’s principle: \f the areas of all parallel planar cross-sections of two 
bodies are equal, then the bodies’ volumes are equal. 


Proof: Consider a “stack” of thin cylindrical disks of arbitrary cross-section (Fig. 
8.38). The volume does not change when the horizontal position of the panes are 
interchanged. Through refinement (limit process), this fact also applies to “infinitely 
thin” disks (one speaks of “elementary disks”). ©) 

This can analogously be applied to planar figures: 

If the chords of the intersections of two figures with straight lines are of the 
same length and if this is true for all chords on parallel lines, then these two 
figures have equal areas. 

The following modification of Cavalieri’s principle is immediately obvious: Jf 
two bodies can be clamped in between two parallel planes so that their cross- 
sections are of the same area on average, then the two bodies have equal 


volumes. 
= a 
“= Ba Sie. = 


Fig. 8.37 Archimedes’s method Fig. 8.38 Cavalieri’s principle 


>>> Application: oblique prisms and pyramids 
Following the above, oblique prisms and pyramids have the same volume V 
as straight prisms and pyramids: 


1 
V = base area x height, V = = base area x height. (8.24) 


By refinement, the theorem also applies to cylinders and cones with arbitrary 
cross-sections. <<< 


>>> Application: volume of a cone 

Show that the volume of an arbitrary cone (pyramid) is one third of the 
volume of a corresponding cylinder (prism) (Formula (8.24)). 

Proof : It is sufficient to consider the respective prototype of the cone with a height 
of 1. Each similar cone results from the prototype by means of scaling, i.e. a dilation 
from a center. 
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Let G be the measure of the base of the prototype. Let us consider the cross-sectional 
area G of the cone at the distance «x € [0,1] from the tip. It changes quadratically 
with x: 


G,=27-G. 


According to Application p. 398, the function y = x? has the average value 7 = 3 
in the interval [0,1]. Thus, the average cross-sectional area is one third of the base 
area G. The volume of the cone, therefore, corresponds to that of a cylinder with the 
same height and a similar base but reduced to a third, or the volume of a cylinder 
with the same base and a third of the height, see also Application p. 425. Q «a 


>>> Application: volume and average cross-section of a sphere 

Show that the volume of a sphere with radius r is equal to the volume of a 
cylinder of revolution with radius r, = \/2/3 r and height 2r, and use this to 
calculate the “average cross-section of the sphere”. 


Solution: 
The cylinder’s radius rz is obtained by equating the volumes of sphere and 


cylinder: 
dn 2 
73 =r? Or > ( Ver 
3 3 


Then, the average cross-section is obtained by reversing Cavalieri’s principle 
2 


with Q = are =§ mr? and is, therefore, 2/3 of the area of a large circle. 
The volume of a cylinder of revolution circumscribed to a hemisphere is, thus, 
3 times as large as that of the hemisphere. Subtracting a cone of revolution 
from this volume, as in Fig. 8.37 (with one third of the cylinder volume, or 
half the volume of the sphere), the new structure has the same volume as the 
hemisphere. This discovery was made by Archimedes, who was so proud of 
it that he had it inscribed onto his gravestone. The proof, which anticipates 
the principle of Cavalieri(!): In the layer plane of height z, the sphere has a 
cross-section of area a (r?— 2). The circular ring cut out of the cone-shaped 
cylinder of revolution has outer radius r and inner radius z (the cone is 


inclined at 45°) and is, thus, the same surface. da 


>>> Application: centroid of a hemisphere 
Show that the “auxiliary solid” in Fig. 8.37 (the cylinder with a conical drill 
hole) has the same centroid as the hemisphere of equal volume. Compute it. 


Solution: 

We divide both solids into arbitrarily thin layers parallel to the base circle. 
According to the preceding considerations, each layer has the same cross- 
section, and thus, the same volume or the same mass which we may think 
of as concentrated in points at the same height on the perpendicular axis of 
rotation. Instead of layers, we can, thus, think of particles of equal mass at 
the same height, which together have a common centroid. We calculate this 
centroid on the “auxiliary solid”, which is produced by drilling out a cone 
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of revolution (negative mass!) from the cylinder of revolution. The cylinder 
of revolution has mass m, and the cone of revolution has the negative mass 
Mg = —m ,/3. The centroid of the cylinder lies at height r/2. For the cone, it is 
at height 3r/4 (the centroid divides the height in the ratio 1:3, Application 
p. 186). Using 5.15 we now get the height of the total centroid 


1 ( rom, 3r ) 3r 
my- : =—, 
2m" 2 3 8 <<< 
>>> Application: surface area of a sphere 

Calculate the surface of the sphere from its volume. 


Fig. 8.39 How to calculate the surface of the sphere from the volume. 


Solution: 

We choose many arbitrary points on the sphere and connect them, forming a 
triangular mesh (Fig. 8.39). Each triangle, together with the sphere’s center, 
forms an arbitrary tetrahedron, to which the volume formula applies: 


1 
Vietrahaaven = ae x height. 


Approximately, the volume of the sphere is equal to the total sum of the 
volumes of the tetrahedra. If we further refine the triangular mesh on the 
sphere, then the height of all the tetrahedra converges towards the radius of 
the sphere and the sum of the base areas towards the spherical surface S. 
Thus, 


dn r 2 

Vsphere = Y Vietrahedra = ar S- 3 > S=4nrr-. 
Accordingly, the surface area of a hemisphere is twice as large as the surface 
area of its base circle or it is just as big as the surface area of the circumscribed 
cylinder of revolution. << 


>>> Application: volume of a torus (Fig. 8.40) 
Show that the volume of a ring torus with radii a and b equals the volume 
of a cylinder of revolution with radius b and height 27 a. 
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Solution: 

The torus can be clamped between its “double tangent planes” on which it 
rests. A section parallel to these planes at the arbitrary height z yields an 
annulus with radii 71,2 = a+ 9 (with @ = Vb? — z*) whose area equals the area 
of a rectangle with sides 2am and 2a: 


A=n(rj-13) = 7 ((at 0)* —(a—0)*) = dar. 


Fig. 8.40 circular layers of a torus 


The length of the meridian’s chord at height z is precisely 20. This rectangle 
is interpreted as the planar section of a cylinder of revolution with a plane 
parallel to the axis. <<< 


Fig. 8.41 bending the torus into the cylinder of revolution 


Thus, according to Application p. 406, a torus can be “straightened” (Fig. 
8.41) without changing its volume. (It is only an imaginary bending. In prac- 
tice, cracks and deformations would occur.) In this case, the circular path of 
the center of the meridian circle is stretched into a straight line segment of 
equal length. Even the surface of the torus coincides with that of the cylinder 
of revolution because the sum of “surface elements” of the torus corresponds 
to the symmetric surface elements of the cylinder at the same height. 

This line of thought can be generalized: The rotating meridian can have any 
shape. The centroid of the meridian surface takes the role of the center of 
the circle. The following useful formulas, proved by Paul Guldin around 1600, 
were documented almost contemporaneously with the principle of Cavalie- 
ri. However, these rules can already be found, although without conclusive 
evidence, in the “Collectiones” of the Greek mathematician Pappos of Alex- 
andria, who lived 1,300 years prior! 
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Guldin’s first rule: The area of a surface of revolution is equal to the product 
of the length of the meridian curve m lying on one side of the axis of rotation 
and the length of the trace of the line centroid of m during a full rotation. 


Guldin’s second rule: The volume of a solid of revolution is equal to the surface 
area of the planar meridian lying on one side of the axis of rotation and the 
length of the trace of the surface centroid of the meridian during a full rotation. 


Thus, it is possible to “bend” each solid of rotation into a cylinder without 
changing its surface or its volume. In this case, the circular path of the line 
centroid or the areal centroid of m is stretched into an equally long straight 
line segment. The exact proof by means of integral calculus will be omitted 
here. 


>>> Application: surface area and volume of a torus 

One computes the formulas for the surface area and the volume of the ring 
torus (a> b) using Guldin’s rules. 

Solution: 

The meridian m is a circle with radius b. For a > b, m lies entirely on one 
side of the axis of rotation. The line centroid and the areal centroid of the 
meridian circle coincide with the center of the meridian. The path of the 
centroid in both cases is of length 27 a. In other words, 


S=2nb-2na, V=nrb?-Qra. <<< 


>>> Application: volume of a sphere 
Use Guldin’s second rule in order to verify the formula for the volume of the 
sphere, which has already been used several times. 


Solution: 

The sphere is generated by the rotation of a semicircle m (“circle of longi- 
tude”). The surface of the semicircle is tn Application p. 398, we have 
calculated the centroid of the semicircle. Its distance from the axis of rotation 
is —r. The distance of the centroids is, thus, 27 — r = 3 r, and the volume 


1 . ae 37 
of the sphere, according to Guldin, is 


Vase arte 2 y8, (8.25) 
ddd 


>>> Application: surface area of a spherical segment 
Calculate the surface of a spherical segment (and, as a special case, the total 
surface of the sphere) with Guldin’s first rule. 
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Fig. 8.42 surface area and volume of a spherical segment. 


Solution: 
A spherical segment is created by the rotation of the circular arc 


(r cosy/r sing), pela, 8] 


about the y-axis. The distance from the circular arc’s line centroid to the 
axis of rotation is given in Application p. 399: 


sin 6 - sina 
B-a 


The path of the centroid is 27 times as long. The length of the arc is r (G-a). 
Thus, according to Guldin’s first rule, we have 


sin 6 - sina 


S=20nr -r(8- a) = 2nr*(sin GB - sina). (8.26) 


B- 


oes 


Fig. 8.43 approximation of a sphere by means of parts of cylinders of revolution 


An interesting application of this formula is given in Application p. 425. 
The special case a = = B= . yields the well-known formula for the surface 


of the entire sphere: 
S=4nr?. (8.27) 


® Remark: This formula can also be found heuristically as follows: The sphere is 
approximated by parts of n half cylinders of revolution (radius 7). These intersect 
each other along half-ellipses. In the limit n > oo, one obtains a smooth spherical 
surface. The parts of the cylinders of revolution can be developed and transformed 
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into lens-shaped structures: The elliptic oblique sections show up as sinusoids. The 
length of such a lens is half the circumference of the sphere (J = 77). The width is 
— for a large n — an n-th of the circumference of the sphere (b = 2a7r/n). According 
to Formula (8.19), we can convert the lens into a rectangle of width 2b 7 a The 
n subsections have a total area of n-ar- # =4rr?.® <<< 
Conversely, Guldin’s rules can be used for the computation of areal centroids. 
The distance s of the centroid of the semicircle from the axis of rotation can 
be calculated from the volume V of the sphere and the area A of the semicircle 
(see Application p. 408): 


Thus, a two-dimensional problem is solved by “escaping” into the third di- 
mension. We have already made use of this as we have interpreted the curves 
of degree 2 (which are planar anyhow) as planar intersections of a cone of 
revolution in three-dimensional space. 

Up until now, we have included integral calculus in the calculation of volumes 
and areas insofar as we have used the principles of Cavalieri and Guldin’s 
theorems, whose exact proofs require integral calculus. It is also possible to 
calculate the volume of surfaces of revolution or their surface area directly: 


If a function graph y = f(x) (a € [a, b]) rotates about the x-axis, then the 
surface area S' and the volume V of the emerging surface and solid of revolution 


are 
b b 
S=2n ff yV lt+y”? dz, ven f y? dx. (8.28) 


Fig. 8.44 area of a surface of revolution 


Proof: We cut the surfaces of revolution into thin slices (Fig. 8.44). These are 
truncated cones with height dz. Such an “elementary frustrum of a cone” has the 
surface area dS. In order to calculate it, we need the “arc element” ds of f(x). 
According to Pythagoras’s theorem, we have 


dy \” 2 
ds = \/ da? + dy? = \ 1+(2) dx=\/1+y" dz. (8.29) 
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The elongated chord (in the limit the tangent) from the curve point P to the 
x-axis (intersection point T) is a generator of the cone. The tip of the frustrum 
of the cone is denoted by T. The distance TP is denoted by t. Then, we have 
dS = n[(y + dy)(t + ds) - yt] = n[yds +tdy + dyds]. Since y : t = dy: ds, the 
equation tdy = yds holds, and we have dS = m[2yds + dyds]. The limit procedure 
with dx > 0, and consequently, dy > 0, ds > 0, causes the product dyds “to be 
of higher order smaller” than dy and ds, and can, therefore, be neglected. Now, we 
have dS =27yV1+y’ 2 dx, and integration provides the above formula for 9. 

It is much easier to derive the formula for the volume: The volume dV of the 
elementary frustrum of a cone for an “infinitely small” dz is not to be distinguished 
from the volume of an “elementary cylinder”, and thus, 


dV =ry' dx 


from which the formula already follows by integration. © 


Fig. 8.45 spherical cap 


>>> Application: volume and area of a spherical cap (Fig. 8.45) 
Calculate the volume and the area of a spherical cap with height h. 


Solution: 
For the meridian of the sphere, we have 27+ y? = r?, and thus, y = Vr? — x2. 
The volume is computed using Formula (8.28): 


V= rf y° dx = nf (r? —2”) da = ar 22-2] 


r-h)? _t 
5 | 3h (3r —h). 


Furthermore, the surface is obtained from 


S=2n f UV ity de. 


-2 
From the equation of the meridian, we get y’ = ae , and thus, 
Vr? — a? 


S=2n fo nts (2) aenre f =. 


=a[r8 s r(r aus 
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Tr r 
= 27 rdx = 2nra| =2rrh. 
r-h h 


We compare this result with that of Application p. 408: There, we had the 
special case with (6 = . and 


S =2nr?(1-sina). 


In fact, for h=r-—rsina, the formulas agree. <<< 


>>> Application: capacity of a drinking glass 

A drinking glass has the shape of a surface of revolution which is generated by 
rotating a sinusoidal curve. Calculate the maximum volume of the drinking 
glass for a = 3.2cm, b= 0.8cm, and h = 12cm (Fig. 8.46, left). 


| 
‘a - D Sin anit X 


y= 


Fig. 8.46 rotation of a sinusoidal curve (drinking glass) 


Solution: 5 
a 
The sinusoidal curve can be described by y = a—- 6 sin - x. For the volume, 


h h 2 
v-nf ypdr=n f (a-b sin“) dx = 
0 0 h 
h h h 2 
-n[a f dx ~2ab [ sin oda +P f sin? <r de] = 
0 0 h 0 h 


h Qn jh | 2 
= 1 [a?h + 2ab— cos a wf 5 (1 c0s22} de| = 
2 h lo 0 2 
>>» —"’ 


we obtain 


h 
_—— 
0 according to Formula (4.15) 
2 h 4 b? 
= r[a2n+0+0?2 mes i COs =r de | = mh{ a? + =|. 
2 2 J0 h 2 
0 


The interim calculation requires one transition to the double angle. For the 
given values, V » 400 cm? = 0.41. 


® Remark: Calculating the integral was quite tedious. We will soon learn an appro- 
ximation method with which we can calculate the value as accurately as possible 
with a computer. Then, the obvious question can be answered: Where should the 
marking for 0.251 be placed? ® <<< 
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8.7 Numerical integration 


We have seen that integral calculus plays an important role in applied ma- 
thematics. There is often a problem in practice: The inability to express an 
antiderivative in terms of elementary functions. 

However, each definite integral can be evaluated. For this, Kepler provided a 
decisive contribution with his so-called “Fassregel” (“barrel-rule”), sometimes 
also referred to as Simpson’s rule: 


a+b 


Kepler's rule: [ f@) are — [F@) 1F( 5 )+F@)]. 


F((a+b)/2) 


fla) f((a+b)/2) 


(atby/2 


Fig. 8.47 Kepler’s rule: area underneath the curve and corresponding volume of 
the solid of rotation 


Proof: The proof is merely sketched: We replace the meridian curve by a parabola 
with the equation y = a2 x? + a,x + ao (see Application p. 33). The deviation will 
remain small if the curve looks parabolic in any case — like the meridian of a barrel 
(Fig. 8.47). 

In order to shorten the calculation somewhat, we restrict ourselves to the parabola 
y = x” +c, which can always be achieved by shifting and scaling. 

The average value of this prototype equals 


Te- + fe +c) da = ops + c(b a)| = 


Bb? +ab+a? a? +b? + (a+b)? +6c 
ee 


3 6 


Now, we evaluate the meridian and get 


eae (a+b)? - 
2 4 

a+b 
2 


f(a) =a’ +c, f(b) =? +e, f/ 


=a? = f(a)-c, b? = f(b)-«, (a4 b)? =4 f/( ) Ac. 


With 


Fe) = [ra +4) + 10], 
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we find the above rule because A = (b-a)f. © 


At first, Kepler’s rule was only used to calculate surfaces. However, Kepler 
computed the volume of the barrel by replacing f(a), f(b), and f((a+b)/2) 
with the areas F(a), F'(b), and F((a+b)/2) of the cross-sections. Then, the 
rule also works for the function F(a) = 7 f?(x) and yields the volume as a 
result — only the function F(a) has a different meaning, and its graph does 
not correspond to that of the meridian (Fig. 8.47). 

According to what has just been said, Kepler’s rule actually works for any — 
hence also non-rotationally-symmetric — solid, provided that the areas of the 
cross-sections on the boundaries and in the middle are known. 


>>> Application: giant wine barrels 
Approximate the length of the huge wine barrel (Fig. 8.48) in the Croatian 
winery Kutjevo near Velika. 


Solution: 

We do not need to estimate the volume, because the barrel clearly shows the 
content: 53,520 liters, i.e. about 54 m?. The two ladies in front of the barrel 
are 165 cm. Since the front diameter of the barrel is about twice as large, 
we have a cross-sectional area of about 8-9 m? at the front and rear. The 
average cross-section should then be 10-11 m?. Thus, we have the following 
approximation (excessively exact computations would be “non-mathematical” 
because the chain is as strong as its weakest link): 


b-a 
6 


54 m? = 


(8.5+4-10.5+ 8.5]m? > b-ae5.4m. 


® Remark: The largest wine barrel in the world is in Heidelberg and has four times 
as much capacity. The lengths are accordingly multiplied by /4 * 1.6. ® <<< 
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>>> Application: How much blood fits into a mosquito? 

A mosquito has just refreshed itself on my left hand (Fig. 8.49, such experi- 
ments should only be made in areas free of malaria). How much blood was 
drained from me? 


The shape of the “blood tank” (Fig. 8.49 below) is ideally suited for the Kepler 
formula. If we take the body as a symmetrical spindle, the cross-sections at 
the two ends are zero. The length b- a is about 10 mm. The central cross- 
section with a diameter of 3 mm is about 7 square millimeters. Thus, the 
volume is 

_ 10mm 


V 


[0+ 4-7+0]mm? « 46 mm’. 


After 20 pricks, 1 cm? (1 milliliter) of blood has been removed. 


® Remark: In comparison with the initial phase of the prick (Fig. 8.49 above), the 
final blood-fattened mosquito is somewhat grotesque. However, it is not because of 
the loss of blood that mosquitoes are so unpleasant. It is rather because of the anti- 
coagulant that the mosquito injects and the diseases that are transmitted: Mosquito 
bites are painful and a danger for humans and animals. 1.5 million people die each 
year from malaria. ® <<< 


>>> Application: How useful is Kepler’s rule? 
Compute the error that is made by Kepler’s rule when computing the volume 


of a sphere, a cone, and a cylinder. 

Solution: A 

For the sphere, we want to estimate [ n(r? — x”)dx. Kepler’s rule yields 
=f 


2 
V= ~ (0+ 47? +0) = r°. 


That is the exact formula (Application p. 408)! 
For the cylinder of revolution, we have 


h 
V= - (rr? +4ar?4+a77) =29r7h, 


and for the cone of revolution, Kepler’s rule yields 


h ry\2 1 
V=-—(0+47 tar?) = —r7h, 
6 ( (5) ) 3 

again no deviation from the exact result (Application p. 404)! 

Additional task: 

Motivated by these results, we will test two more solids: The spherical cap 
gives the approximation 


ah 
V= [0 An(r? (r )?) a(r? G h)’)| = 7 (3r-h), 
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Again, this coincides with the exact formula (Application p. 411)! 

For the torus, the exact formula is V = 277 ab? (p. 408). The approximation 
gives 

167 


2b 
Ve 5 (0 Ax ((a +b)? - (a~b)?) +0] = 5 a. 
This time, there is a deviation of about 18%. <<< 


Kepler’s rule uses only three curve points (or cross-sections) and is, as we 
have just seen, often a sufficient approximation, especially for the calculation 
of the volume of simple solids of revolution. When calculating areas in the 
plane, it is also surprisingly accurate for parabola-like functions. 

For more complicated curves or solids, 2m +1 interpolation points are se- 


b-a 
lected equidistantly with a distance of h = —— to their neighbors, and one 


obtains m adjacent approximating patabdlas, The corresponding partial are- 
as are evaluated with Kepler’s rule and summed up in order to get the total 
area. In this case, the boundary values f(a) and f(b) occur only once. The 
“odd vertices” occur four times. The “even vertices” but two times (because 
there are two parabolic arcs there). The corresponding formula is used by 
computers: 


Simpson's approximation formula: 


fl Flr)ae © [f(a) +4 flat h) +2 f(a+ 2h) +4 f(a+ 3h) bo 


b-a 


... +4 f(b- 3h) +2 f(b-2h) +4 f(b h) + f(b) | with h= >. 


>>> Application: numerical integration 


Lod 
Calculate [ aa 

9 l+2? 
formula at five supporting points. 


once exactly and once by means of the approximation 


Solution: 
The exact solution 


1 
d 
ra H <2 gichan i= Breas a 7e5e0s... 
J 1+ 2x? 4 


is known in this case. Now, we use the five evenly distributed knots: 


hg — (OM (Gaz): (515) Gala) (v3) 


8.7 Numerical integration 417 


With Simpson’s rule, we obtain: 


1 


rs 


l+22 12 


a 5(} a se tg] 0.785392... 
17°52 2 


0 05 1 
Fig. 8.50 five knots for Simpson’s formula 


We see that this “harmless” function already gives good results with only a 
few supporting points. 

Using 7/4 = arctanl > a = 4 arctan1, the number 7 can be computed 
very precisely on a computer (7 is not predefined in every programming 
environment!). The computer calculates the arctan by means of expansion 
into a Taylor series (power series expansion). <<< 


>>> Application: the area underneath the Gaussian curve 
(Application p. 371) 
This area plays a fundamental role in probability theory (Chapter 8): 


2 


1 fo 


The only problem is that this particular integral can only be evaluated nume- 
rically. Moreover, it is an indefinite integral where even thousands of support 
points may deliver inaccurate results. One finds a way out of this dilemma. 


Solution: 


NO 


x 


1 
The curve e 2 optically resembles the function ie (Fig. 8.4). It has 2 = 0 
x 


as a maximum and is “almost zero” for |2| > 5: f(-5) = f(5) * 3-10-°. Between 
—oo and —5 the area is » 3-107" and thus it is negligible — the same applies 
to the area between 5 and oo. Practically, the entire area (0.9999994 of 1) is 
in the interval [-5,5], and there, only a few support points are sufficient. 

A completely different solution is obtained as follows: According to Formu- 


la (8.5), the power series expansion of the function y = e™” is as follows: 


2 2 xv xv x 
2! 3! «A! 
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We need to compute 


4 6 .8 
b —x? = b 24 . ne — 
{,@° de=], ( Be a a .) do 
ee 2 x oie ‘ 
=l|r + wel. 
3° 5-2! 7-3 9-4) | 


Now, we can prove that this series expansion converges for every x, although 
it converges very slowly for large |x|. For |x| < 1, the function converges very 
quickly, and one can also calculate probabilities with it. <<< 


>>> Application: clothoid or Cornu’s spiral (Fig. 8.51) 

In road construction, you need curves which continually increase or decre- 
ase the curvature, so that you can turn the steering wheel uniformly when 
going through the curve. These curves are given without proof in a parame- 
tric representation (parameter t) by the following integrals, which cannot be 
computed by means of elementary functions in a closed form: 


t ; t 4 
et) = Le sp az, y(t) = = sp dy. (8.30) 


The integrals can be calculated using Simpson’s approximation formula. 


Fig. 8.51 clothoid Fig. 8.52 turbulence over the Atlantic 


® Remark: Curves that almost resemble the form of clothoids are also found in 
nature. Fig. 8.52 shows that sometimes such forms are found even on weather maps, 
as if turbulence “evenly twists the steering wheel’. ® daa 


Obviously, Simpson’s formula also works if the function to be integrated is 
not given by mathematical formulas, but only by an odd number of evenly 
distributed measured values. If the measured values are unevenly distribu- 
ted, the curve is approximated by spline curves in order to achieve an even 
distribution. 


>>» Application: average dive depth, air consumption (Fig. 8.53) 
Dive computers record the current depth of the diver at regular intervals and 
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calculate, among other things, the air consumption or the nitrogen content 
in the diver’s blood. For a given “time-dive-depth diagram” d = d(t), we give 
the average dive depth D and deduce a formula for the consumption of air. 


T 


Fig. 8.53 fast descent, slow ascent, safety stop 


Solution: 

Fig. 8.53 shows a typical “dive profile’: The diver searches as quickly as pos- 
sible for the maximum depth and remains there for a certain time (“bottom 
time”). Then he/she ascends not too fast (maximum 15-20 m per minute!) to 
about 5m depth, and remains there for several minutes before finally leaving 
the water. 

Let T be the total dive time. The average dive depth, according to Formu- 
la (8.18), is 


D = d(t)d 

= t) dt. 8.31 
ii js © ae 

Since the external pressure increases by 1 bar every 10m, a pressure of 1 + 


— bar prevails at a depth of dm. The valve of the tank with compressed air 


must push the same amount of compressed air into the lungs so that they do 
not collapse due to external pressure. Thus, air consumption is proportional 
to external pressure. Within time 7’, Vo = cT is the amount of air consumed 
on the surface. In each small time interval dt, dVo = cdt of air is consumed. 
In the same time interval of dm, 


t 
dV =c|{1+—~ oo) ) dt 
10 
amount of air is required. Thus, the total air consumption equals 


V= of (1+ 92) a=e [Tare = Patt) at, 


and therefore, using Formula (8.31), 


VeeT+<7D=cT (1+7)- Vo (1+ +z). 
10 10 10 
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8.8 Further applications 


>>> Application: calculating limits by means of series expansion 
Determine the following limits: 


Solution: 6 
All three expressions are indeterminate at x = 0, namely “—”. We use the 


series expansion which enables us to divide by powers of x and get “reasonable 
values”: 


x 7 
roe e-—t. a) 
lim —— = lim 3! =1 salle si 
z>0 @¢ x0 x 3! 
x2 ge gt 7h 
d i +11 + + dl 
— +cCosx a 2 ( 2 6A! 7 
40 x 10 az 
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>>> Application: period of oscillation of the pendulum (Fig. 8.54) 
Show that the period of oscillation T » 27\/L/g is independent of the opening 
angle » for small y. 


Solution: 

At point P, the normal acceleration is gsinw. This reminds us of the rotation 
of a point Po about a fixed point — even there, we have the same acceleration 
distribution (Application p. 274). The oscillation duration of P corresponds 
to the orbital period of the point Pp at the auxiliary circle c. Its radius is 
r= Lsiny. Thus, the circumference of c is U = 27 sing L. 

The path velocity v at Po is equal to the velocity of P at the lowest point. The 
height difference of P is A = L(1-cosy). At the lowest point, its potential 
energy is completely converted into kinetic energy: 


2 
mg A=m-> = v= /2gh = /2(1- cose) Lg. 
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This means that we have the desired orbit time 


_U 27 L sing sin y L 


a \/2(1-cosy) /Lg a ,/2(1 — cosy) | g 


wl 


Now, we use the approximations from Application p. 383, according to which 
both siny and \/2(1— cosy) agree quite well with y, so that the fraction is 
approximately 1. For a small y, T is independent of y. The following table 
provides information on the quality of the approximation: 


y | radians Ga error [%| 

5° 0.09 0.99905 0.1 

10° 0.17 0.99619 0.4 

A 15° 0.26 0.99144 0.9 
20° 0.35 0.98481 1.5 

25° 0.44 0.97630 2.4 

30° 0.52 0.96593 3.4 

35° 0.61 0.95372 4.6 


Py ! 
Fig. 8.54 pendulum 


A comparison with the table in Application p. 383 shows that, through the 
two approaches, the errors even “cancel” each other out to some extent. The 
error due to the approximation of the fraction is virtually negligible for y < 
15° (long pendulum with a small deflection, see also Application p. 20). <««« 


>>> Application: average brightness of the Moon (Fig. 8.55) 
Calculate the average brightness of the Moon in the transition phase from 
half moon to full moon (new moon). 


Solution: 

The Moon is illuminated by the Sun. Because of the Moon’s spherical shape 
and the nearly parallel light rays, exactly one half of the Moon is illuminated. 
The boundary of the shadow is a great circle of the Moon, which, viewed from 
the Earth, appears as an ellipse e whose principal axis divides the outline of 
the Moon into two equal halves (the principal vertices of the ellipse are at 
the contour). The illuminated lunar surface, thus, consists of the area of a 
semicircle together with half the area of the ellipse e. Looking at a full moon 
period (it takes 29.53 days to go from one full moon to the next), the average 
illuminated area is equal to the area of the half moon for “symmetry reasons”. 
This does not mean that between half moon and full moon (new moon) 3/4 
(1/4) of the surface is illuminated on average, as we shall see soon: 
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Fig. 8.55 full moon, waning moon Fig. 8.56 rotation of the Moon 


Let a be the radius of the contour circle and b be the semi-axis of the elliptical 
margin of the half moon. Then, we apply Formula (8.17) to the illuminated 
area: 
A= a4 ~ab= 7 alas). 
2 2 2 


Since the entire lunar disc has area A, = ra”, the portion of the illuminated 
area S' (and thus, the brightness h) equals 

je A _atb 5(1+2). 

Ao 2a 2 a 

Simplifying, we assume that the lunar orbit is circular and the rotation of the 
Moon is uniform (which is quite close to reality). Let T be the time between 
the half moon and full moon (T' * 7d). In this time, the angle of rotation is 
about 7/2 (that is, 90°). Yet, it must be remembered that the Earth and the 
Moon turn around the Sun during these 7 days by about 7° (in Fig. 8.57, 
the Moon’s rotation is illustrated by 12 interpolations). Since the angle does 
not enter the following considerations quantitatively, nothing changes in the 
“average result”. 
After t days, the angle of rotation equals y = aE. The semi-axis 6 of the 
ellipse is obviously proportional to siny (Fig. 8.56) and the current lunar 
brightness differs from the brightness of the crescent by the value Ah = 
+ sing. According to Formula (8.19), the average sine value is 2 and Ah = 


1 


= » 0.32. In the week when the half moon waxes to the full moon, the Moon 
has an average of about 0.5 + 0.32 = 82% of its maximum brightness. In 
the week when the half moon wanes to the new moon, it has only about 
0.5 — 0.32 = 18% of its maximum brightness. 

® Remark: In fact, from one day before the full moon until the day afterwards, there 
is still the subjective impression that it is a full moon. The “true” full moon is best 
recognized by the fact that on this day, the Moon rises from the horizon at roughly 
the same time as the Sun descends on the opposite side. In the preceding days, the 
Moon rises about 50 minutes earlier per day. That is why divers like to do their 
“night dive” a day or two before the full moon just after sunset. The diver can find 
a perfectly acceptable minimum amount of natural illumination. 


The inclination of the half moon, which depends on the geographical latitude, is 
explained in a simple, geometric manner: The Sun and the Moon move relative 
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to one another in a single plane. Both rise and descend in moderate widths at 
a shallower elevation than in tropical regions. In the equatorial region, the setting 
waxing moon is rather lit “from below”, and the crescent moon “is horizontal”. In the 
southern hemisphere, the Sun apparently travels in the opposite direction, because 
at noon it is generally in the North. Thus, the crescent points in the direction 
opposite to where it points in the northern hemisphere. @ 


a 3 


Fig. 8.57 Moon phases in theory and in practice 


® Remark: The computer simulation shown in Fig. 8.57 shows a cycle of approxima- 
tely four weeks which takes place in the northern winter (at the North Pole, there 
is deep polar night). It can be seen that the angle between the Earth’s axis and 
the direction to the Sun or the Moon is approximately the same for the new moon 
(position 0). As a result, the Sun and the Moon migrate on nearly identical paths 
in the firmament (only in this phase can a solar eclipse occur). About 15 days later 
(between positions 6 and 7), there is a full moon. Now, the angle between the axis 
of the Earth and the direction to the Moon is quite different from that between the 
axis of the Earth and the direction to the Sun. The lunar path on the firmament 
will, thus, significantly deviate from the Sun’s orbit (much steeper in the North). 
In the winter of 2005/2006, the Moon reached a particularly high elevation angle 
(recurring every 18.2 years) because of the extreme lunar inclination of 5.2%. ® 


® Remark: In the south-west of Antarctica, there is a period of several months of 
polar night, in which the full or almost full moon illuminates the arduous rou- 
te of emperor penguins (Fig. 2.42) marching to their breeding grounds (https: 
//en.wikipedia. org/wiki/March_of_the_Penguins). One more remark: How in- 
variably beautiful must the “crescent Earth” appear from the Moon (Fig. 7.28)! 
Since the Moon always turns to the same side, there is no “rising Earth” or “setting 
Earth” on the Moon. On the side facing us, the Earth is always at the same place 
in the firmament because of the elliptical orbit, and there it always undergoes the 
“Earth phases”. @ <<< 
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>>> Application: effective current 

Meters for currents measuring alternating currents do not indicate the ma- 
ximum current Imax, but the square-average current J,¢ which is also called 
the effective current. Calculate this for I = Imax coswt (t is the time, w is the 
angular velocity of the rotating current-generating coil). 


Fig. 8.58 the average value of y = (m cos)? equals m?/2 


Solution: 

We set x = wt and m = Imax. Then, we get I? = m?cos* x. The average 
function value of J? in the interval [0,7] is 4+ fo’ I?dx. However, the ave- 
rage value can also be calculated immediately without using the formula 
cos” x = $(1+cos 2x), which can be derived from the addition theorems (see 
Formula (4.15)). The function y = m? cos? x, therefore, has a sinusoidal graph 
(Fig. 8.58) which has the z-parallel y = m?/2 as the center line. Thus, its ave- 
rage value in the interval in question is m?/2. Io is defined as the square 


root of this average value, and we have 


2 


L, max 


5 


>>> Application: fat deposits in the hip region 
In German, the fat belt around the hip is colloquially known as a “swim ring”. 
Estimate its mass. 


Fig. 8.59 various swim rings (left: toroidal surface with a meridian ellipse) 
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Solution: 

We shall approximate the object in question by a surface of revolution with 
an ellipse for its meridian curve (Fig. 8.59, left). Its meridian section is a 
half-ellipse with a height of 20 cm and width b. The area of the half-ellipse is 
107 b/2cm? » 15bcm?. The ideal hip circumference (without tire) is 80 cm. 
Then, according to Guldin’s rule (Application p. 408), the tire has a volume 
of 1200bcm?. Each additional centimeter, thus, stores about 1 kg of fat (fat 
is lighter than water). 


® Remark: In order to lose 1 kg of fat “forever”, one has to eliminate about 40,000 
KJoule (or 10,000 kilocalories). People consume about a quarter of that in a day. 
Theoretically, one would have to drink water for four days to reduce b by one 
centimeter (in fact, decreases or increases primarily occur in the hip area). It is best 
to reduce calorie intake slightly over a longer period, because it is permanent. ® 

The maximum buoyancy of the real floating tire in Fig. 8.59, on the right, 
can be estimated quickly: it is a (slightly flattened) torus. At a diameter of 
10 cm, the area of the meridian section is approximately 25 7 » 80cm”. With 
a mean diameter of 35 cm (circumference 357 * 110 cm), we have a volume 
of about 9 liters, and thus, in order to submerge the buoyancy completely, 
one would need about 90 Newtons. This is easily enough to keep the head of 
a child completely above water. <<< 


>>> Application: volume of a pyramid’s frustrum (Fig. 8.63) 
Even the ancient Egyptians used, without proof, the formula for the volume 
of a frustrum of a pyramid with a square base (lower edge length a, upper 
edge length b, height h): 
h 
V= a (a? + ab+ 0’). 
Prove the formula using Cavalieri’s principle. 


Proof: The cross-section of the pyramid changes quadratically with the side length x: 
A(x) = a7. The average value of the function A(x) is (see Application p. 398) 


| 


1 a 4 1 2) ab at+ab+b? 
f xu dx = = 
a-b Jb a-b 3 


b 3(a—b) 3 


The capacity of the volume-like prism is obtained by multiplying A by the height h. 


® Remark: Egyptian (and Babylonian) mathematicians did not give rigorous proofs. 
In this way, formulas were used which are only correct in special cases. For example, 
they had an incorrect formula for the calculation of the area of a general quadrangle 
of which the sides are known and which is not yet clearly defined. @ ©) 

The advantage of this consideration is that the proof can be applied directly 
to any truncated pyramid or truncated cone, see also Application p. 404. «<< 
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>>> Application: navigation problems with a GPS 

When we are in a flat terrain, there are always “more than enough” satellites 
that can be used for navigation using a GPS. However, when hiking in the 
mountains, it is often the case that satellites are “visible” only from a certain 
angle of elevation y. How many satellites are “visible” from a flat terrain? 
What is the critical inclination angle y? 


Fig. 8.60 navigation and... Fig. 8.61... houses as obstacles 


Solution: 

Let R= 6,370km be the Earth’s radius. The 24 satellites orbiting the Earth 
at an altitude of h = 20,000km (distance r = R+h from the center of the 
Earth) are distributed quite evenly on a spherical shell with the area S = 
An r?. The “visible area” is nearly a spherical cap whose area S* is computed 
with Formula (8.26) and { = * 

S* =2nr?(1-sina). 


So, we have an average of 
24. = = 12(1-sina) 


“usable” satellites. The angle a is not the same as the inclination angle y, 
but according to Fig. 8.62, we have 


rsina-R 

tan y = —————— 
r COS@ 

For y = 0, we find a = arcsin = ~ 0.25 (14°). 

This results in an average of eight satellites. 

For exact position determination, one needs 

“only” four satellites. 


Fig. 8.62 satellites 


Just four satellites can be seen if 


2 
a (x 42°) => tany # 0.48 > y x 26°. 
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Such a restriction occurs relatively often on a mountain hike (Fig. 8.60) and 
even more often in a big city (Fig. 8.61)! 

@ Remark: Actual efforts have been made to increase the number of navigation 
satellites to at least 32. Otherwise, the navigation of cars in narrow major city lanes 
becomes a problem! ® <<< 


>>> Application: volume of a beam connection (Fig. 8.64) 
Calculate the volume of the union of two congruent cylinders of revolution 
whose axes intersect at right angles (radius r, height h). 


Fig. 8.63 frustrum of a pyramid Fig. 8.64 beam connection 


Solution: 

It is clear that the volume of the union of the two cylinders is equal to 
the total volume of the two cylindrical beams minus the (doubly counted) 
volume of the average of the two cylinders (this is a fundamental rule of 
Boolean algebra, which we will not discuss in more detail here). We first 
calculate the volume of the average. 

As a reference plane, we select the horizontal plane of symmetry that is 
spanned by the cylinder’s axes. The cross-section of the intersection with a 
plane parallel to it at a distance z provides a square with the area 


A=(20)*=407 (with 9 = Vr? - 22). 


The intersection of the cylinders is circumscribed by a sphere of radius r. 

The circle of the sphere at height z has the area A = 7 0”. The cross-sectional 

surfaces of the sphere and of the intersection solid differ only by the constant 
4 

factor —. The spherical volume is E r?, which is the desired average volume 
T 


Vin V2 = set as 

nT 3 3 
We also get the same value if we assume the average cross-section of the 
object with 2/3 of the maximum cross-section (2r)? as in Application p. 405. 
The squares in the horizontal intersections are always circumscribed by the 
circles of the sphere: 


2 i 
Vin Va = 5-(2r)?-2r= F8, 
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For the third time, we get the same value with Kepler’s rule (p. 413), although 
this would only be an approximation: 


2 1 
Vo" OOH Le. 
6 3 
Finally, the volume of the beam connection is 
16 
Vi U Va = Vi + Va ~ Vin Va = 2 (ar? h) - Sr". da 


>>> Application: weight of an antelope’s horn (Fig. 8.65) 

Antelopes have spirally shaped horns with a characteristic cross-section. The 
circumference u of the cross-section of the base and the height A of the horn 
can be easily estimated. The density of the horn substance is about 2 g/ cm?. 
What is the load that the antelope has to carry? 


oo ee 
Fig. 8.65 kudu, oryx, nyala 


Solution: 

Horns grow according to the so-called “heli-spiral motion”. If one knows a 
cross-section (all other parts are similar), their volume can be equated with 
a cone having the same cross-section according to Cavalieri’s principle. The- 
refore, it is determined by the height, but not the length of the curved center 
line! 

If the circumference wu is given and the cross-section is not extremely elonga- 
ted, we are content with the following approximation: We replace the cross- 
section by a circle of the same circumference and then reduce the area of 
this circle, for instance, by 2/3, as the circle is the closed curve which enclo- 
ses a maximum area for a given circumference. The corresponding volume- 
equivalent cone is, then, a cone of revolution with the desired volume 


2fa\? hk be? 
Vas ( ) To : 
3 \27 3 60 
Numerical example: For u = 20cm and h = 90cm, we find V x 600cm®. Both horns 
together have an estimated weight of more than 2kg. However, parts of the horns 
are hollow. Therefore, we must estimate the actual weight to be lower. 
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@ Remark: As opposed to antler bearers, horn bearers do not discard their “head- 
dress” every year (because they occasionally use it as a dangerous weapon, such as 
the oryx antelopes in the center of Fig. 8.65). Thus, their weight cannot be measu- 
red. The above estimate is, therefore, of practical value. More problematic than the 
weight is the great leverage that occurs with long horns. ® daa 


>>> Application: marking a drinking glass (Fig. 8.46) 
In Application p. 412, we looked at a drinking glass generated by the rotation 
of a sinusoidal curve. Where should the 0.25 /-mark be placed? 


Solution: 
This time, we will use numerical integration. For the volume of the drinking 


glass, we have 
A Qn .\* 
ven f (a-b sin =r) dt 
0 h 


(the integration variable has been relabeled to t). If we consider the upper 
limit of the integral to be the variable x, then we obtain the “volume function” 


x Qn \? 
V(x) = fA _bsi <1) dt 
(=e » (a7 b sins 


dependent on the fluid level x. For x = 12, we obtain 398 cm® from Application 
p. 412. 

Now we look for the solution of the equation V(x) = 250cm°, that is, the 
zeros of the function V(a) - 250 = 0. This is done using Newton’s method 
and leads to x = 8.64cm: The mark must be placed at this height. << 


>>> Application: How many people have ever been born? 

The left of Fig. 8.67 shows the development of the Earth’s total population. 
You do not need to be a mathematician to recognize the explosiveness of the 
curve drawn in red. Here, we will only raise two unorthodox questions: How 
many people have ever been born and how many of them are still alive? 
Solution: 

The area underneath the curve can be interpreted as a measure of the number 
of people that have been born. The curve does not, of course, start at the year 
0 but goes back a long time. It is estimated that about 5 million people lived 
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population in billions [population in billions 
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Fig. 8.67 A steep upward movement... Fig. 8.68... with a foreseeable end? 


10,000 years ago. Let us approximate the red curve in the left-hand region 
by a straight line from (—8,000/0.005) to (0/0.3). The contribution beyond 
this period can be neglected. A numerical integration shows that the number 
of people born in the last 2000 years is equal to the number of people born 
before then (in 2045, the ratio will be clearly in favour of the right half). 
Now we need the average age of a modern person. It is under the age of 
30 (in many developing and emerging countries, it is considerably lower; in 
Europe and North America, it is much higher). If y is the current year, then 
the fraction p is the quotient of two definite integrals: 


p= f° fa)de] f° f(2)ae. 


For y = 2006 specifically, we have p = 0.06. That is, 6% of all people ever born 
are alive. In the year y = 2045, it will be 9%. 


® Remark: Fig. 8.68 shows a detail of the red curve over a period of 100 years. 
Currently, the world’s population is growing annually by about the population of 
Germany. The right part of the curve is, of course, an extrapolation! If this proves 
true, a stabilization of the world population at a high level is in sight at the end of 
the century. This calculation also shows how the population figures will develop on 
the individual continents. Obviously, everything depends on Asia. ® <<< 


>>> Application: differential equation of the catenary (Fig. 8.69) 
Determine the curve that is the equilibrium position of a “heavy rope” (or, 
for example, of a chain). 

Solution: 

In order to calculate the equation of the curve, consider a curve point P(a/y) 
and an infinitesimally close point Q(a + dx/y + dy) (Fig. 8.69). The arc ele- 


ment PQ has the length 
ds = \/ dx? + dy?. 


The weight Fi of the rope piece from P to Q is proportional to the length ds 
and acts in the y-direction. 

—>> 
The tangent @ has the direction vector tg. Those in P have the direction 


vector tp. The two tractive forces at the rope points P and Q act in the 
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Fig. 8.69 a differential equation 


direction of the curve tangent. Thus, they are multiples of the tangent vectors. 
The weight and tractive forces must be balanced so that 


A: f =B-tg+C- tp. 


The vector equation is written down in detail: 


A gate) Alyas}C ae) 


When comparing the first row, we see that B = C’. We divide the second row 


by B and get 
A 
pve + dy? = y'(x+dz) -y'(z). 


If we now set A/B =a and divide the equation by dz, we get 


av da? + dy? y'(a+ daz) -y'(a) 
dx 7 ; 


dx 


Now, we perform the limit Q > P: dx > 0, which causes y/(x + dx) > y”: 


aVl+y2=y". (8.32) 


Obviously, this differential equation is of the second order (the second de- 
rivative occurs). Since y does not appear, the differential equation can be 
reduced to a differential equation of the first order with the substitution 


2(a) = y"(a) > 2"(a) = y"(a): 


aV1+22 = 2! > 2% =a7(14 27). 


This is usually solved by separating variables: z = we? *+vel® => 2! = pueP*+ 
qv e%* is then inserted into Formula (8.32) and coefficients are compared. One 
can also guess the solution: 


z=sinh— = ——"—. (8.33) 
a 
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This results in 


y= a a(e)de= acosh— +C. 
a 


A prototype of this family of curves is 


y = acosh = (8.34) 


The vertex S(0/a) of this catenary is, then, on the y-axis. For a general 
vertex S(29/Yyo), the equation is 


«&— XO 


y = acosh + (yo —a). (8.35) 


a 


All catenaries are similar to each other (up to a scaling factor a) — as are all 
circles, all parabolas, and all equilateral hyperbolas (Application p. 242). <««« 


>>> Application: arc length of the catenary 
Compute the arc length of the catenary in the interval x < 7 < x9. 


Solution: 


ns *s/1+y?de 


1 


with y' = sinh 8 > V/1+ y’ = cosh —*. This integral can be expressed in 
terms of elementary functions: 


L=a (sinn eo aah 2) (8.36) 
a a 


For x9 = 0, x1 = 0, and x2 = 6 specifically, we have L = asinh . <<< 
a 


The rope pulls at the suspension point B with a force F that acts in the 
opposite direction of the tangent vector of the curve and whose magnitude 
eo : ae ; z_ b 
is, therefore, proportional to its length V1+ y = acosh =. 


>>> Application: chain of prescribed length on two points (Fig. 8.70) 
This task, which frequently occurs in practice, also leads to a function f(a) = 
0, which can only be solved by means of an approximation. 

Let P,(21/y1) and P:(x2/y2) be the two end points and let / be the given 
length of the chain. Then, we have 


f(a) = 2asinh 2@—* — JP = (yp —y1)?. 


2a 


This leads to _ 
p= %1+22- 2a artanh 
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with artanhz = $In[(1+2)/(1-<)] and, subsequently, 


b= y, +a-acosh 22. 
a 
Numerical example: 
P,(2/3), P2(8/5), l= 8 (> a1 = 2, y = 3, v2 = 8, yo = 5) > Xo = 2.364, yo = 
4.396, a= 1.678. << 


B (Np) 


Fig. 8.70 chain of a given length Fig. 8.71 catenoid as a minimal surface 


>>» Application: the “catenoid” as a minimal surface (Fig. 8.71) 

If a wire rack composed of two parallel circles is immersed in a soap solution, 
the resulting soap film has the form of a catenoid. This surface is generated 
by the rotation of the catenary around the z-axis. Now, replace the meridian 
of the catenoid by a parabolic arc that has the same vertex and the same 
end points. This results in a new surface of revolution that is hardly distin- 
guishable from the catenoid. Show by means of a numerical example that the 
catenoid has the smaller surface area. 


Solution: 

For the sake of simplicity, we choose the “standard catenary” y = coshz. The 
boundary circles are given by 2, = —1 and x2 = 1. Then, the surface area of 
the corresponding catenoid (the darker part in Fig. 8.71) equals 


1 1 


S =20 [ coshz V 14+ sinh? x dx = 27 [ cosh? x dx = 17.677. 


-1 -1 


The approximation with the parabola has the equation 
y=(coshl-1)27+1 => y’ =2(coshl-1)z. 


The corresponding surface area (numerically evaluated) is 17.680, and thus, 
only 0.05 per mille larger. 

® Remark: The surface area of the cylinder of revolution through the two congru- 
ent boundary circles of the catenoid is larger than the surface area of the catenoid 
in-between these two circles provided that the circles are not too large. Fig. 8.71 
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illustrates that the cylindrical surface area can, however, be smaller than the surface 
area of the catenoid for farther away boundary circles. The “minimal surface conditi- 
on” does not always provide the absolute minimum (unstable form). In the appendix 
on complex numbers, we will discuss the catenoid again (Application p. 548). ® 


ddd 


Fig. 8.72 Herrenkrug bridge in Magdeburg: The ropes are parabolas in non-vertical 
planes. The “proof”: In the photograph, the images of the ropes are conics. 


>>» Application: What is the shape of the ropes? (Fig. 8.72) 

Free hanging ropes have the shape of a catenary in a vertical plane. If the 
ropes are strained equally, their shapes converge to parabolas. How can one 
verify this on a photographic image? 

Solution: 

A parabola is a conic section. We know that perspective images (such as 
photographs) of conics are again conics, though the type of the conic may 
change. A parabola only appears as parabola under specific conditions, which 
are usually not fulfilled in an arbitrarily taken photo. So, in most cases, the 
image will be either an ellipse or a hyperbola. On the other hand, a catenary 
will never appear as a conic. 

Conics are defined uniquely by five points. So, if we choose five image points 
on the rope and there is a conic that passes through them, then the corre- 
sponding space curve is very likely to be a conic. If we are sure that the curve 
is planar, it must be a conic. Fig. 8.72 shows that for this bridge, the ropes 
are conics — and certainly not catenaries. It is possible to determine whether 
they are parabolas or not, but this would go too far here. 
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The “proof” is, of course, only heuristic and not strictly mathematical, but at 
least it provides “strong evidence”. If the result is confirmed in one, or better, 
two additional pictures from other viewpoints, then we can be sure, and this 
could be accepted as a proof even by a mathematician. 

® Remark: If a rope is not strained equally, however, the result can be “basically 
everything”. ® <<< 


>>> Application: differential equation for free fall with air resistance 
Find the falling speed as a function of time. 


Solution: 

In Application p. 21, we had Newton’s formula for the velocity in free fall, 
2 

and with it, the equation ma=mg-cw Ao*,. If we divide by m, we get 


ge setae with b= 


cw Ao 
~ dt , 


2m 
We can separate the variables ¢ and v and get 


1 


a 
g—bv? 


dv. 


Now, we integrate both sides: 
1 a 1 1 1 
t= i dv = [ du = ‘ dv. 
g—bv2 g 1- by? g 7 


With the substitution 


oN ey 
g g b 
we obtain 
1 1 1 : 
t=,/— [ ——,; dx =,/—artanh x according to Formula (6.22). 
bgJ 1-2? bg 


If we apply the hyperbolic tangent on both sides, we obtain x (and hence v) 
as a function of t: 


r=y/bo=tannyoot => v= [4 tanh you~, 
g 


For t > oo, the hyperbolic tangent converges to 1 and the velocity converges 
towards the terminal velocity Umax = \/g/b (Application p. 21). 

® Remark: Of course, the dependence of the fallen distance on time is also interes- 
ting in this context. We have to integrate this again. This results (without further 
explanation) in 


s(t) =1/b In(cosh /bgt). 
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Those who are not convinced only need to differentiate s(t) with respect to ¢ using 
the chain rule which, as is well-known, is always simpler (one can still say: one has 
guessed the solution ...). In practice, we consult collections of formulas to check 
if one can find the desired integral or we work with algebra systems like Derive, 
Mathematica, Maple, etc. @ <<< 


>>> Application: vector fields and flux lines 

If there is a tangent direction y'(x,y) assigned to each point P(a/y) in 
a certain domain, then there are curves that “integrate” these directions. 
Determine the integral curves for y’ = —4 (Fig. 8.73, left). 


Solution: 
With y/' = a we have to solve the differential equation oo = —4. In this case, 
the variables x and y can easily be separated and dy — After integrating, 
the name of the variable does not matter. We can integrate this equation 
with Formula (8.10) and get In|y| = —In|z|+C or In|y|+In|2| = C (both sides 
provide an integration constant, which we write on the right-hand side of the 
equation as C’). Using the computation rules for logarithms, we have In| y| = 
C => ry = +e© = +a. These are the equations of all equilateral hyperbolas 
with the axis of the coordinate system for their asymptotes (Application 
p. 242). 
® Remark: On the right of Fig. 8.73, iron chips under the influence of a magnetic 
field are shown. However, the corresponding differential equation is not so easy to 
solve. Flux in liquids and in the atmosphere can also be described by vector fields. @ 
ddd 


9 Statistics and probability calculus 


The word statistics was first employed as a synonym for “numerical descrip- 
tions of societies” in 19th century England and France. In those times, no 
inferences had yet been made about the properties of individual people. Only 
in the late 19th century did statistics experience a boom, which coincided 
with the rising use of mathematical methods in the natural sciences. 

Today, statistics does not merely refer to population statistics, but to a 
branch of mathematics that investigates mass phenomena and attempts to 


develop decision-supporting methods. One may even put it more poetically: 
Statistics is the art of learning from numbers. 

In descriptive statistics, data is sensibly ordered, grouped, and summarized 
through graphics, tables, or measures. This is usually done in an immediately 
accessible way, as it is aimed at drawing attention to the essence of the data 
set (a data sample). 

In order to draw correct farther-reaching generalizations from a data sample, 
we need to employ the fundamentals of probability calculus. This will, in 
turn, enable us to learn about probability distributions, and in particular, 
about the “central limit theorem”, which allows us to make predictions and 
draw conclusions from statistical materials. 
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9.1 Descriptive statistics 


Bar charts, pie charts, histograms 


A bar chart or histogram consists of a set of columns or bars, whose heights 
and surface areas are proportional to measured or calculated values. 

>>> Application: number of computers worldwide 

The table to the left shows statistics 
about the worldwide numbers of com- 
1975 50 puters in the years 1975-2010.% What 


year number (in thousands) 


aoe . aap is particularly noticeable is that the in- 
1990 100. 000 terval differences are not always uni- 
1995 295, 000 form (2003 is inserted). Since the ta- 
2000 523, 000 ble was compiled in 2006, the value for 
2003 738, 000 2010 is given as a prediction. 

2005 896, 000 

2010 1.350.000 “Data from www.etforecasts.com/products/ 


ES_peww1203.htm, January 2006. 


A 


amount in millions 


Xe) fo) wD lo) wo So Mw oO 
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F a, 54 4,35 aR A 
Fig. 9.1 growth in the number of computers 1975-2010 


In Fig. 9.1, the temporal range is notated on the axis of abscissas and the 
associated number of computers on the axis of ordinates. Two different pre- 
dictions for the year 2010 can also be seen. At any given time, the associated 
number of computers can be estimated by interpolation. The measured data 
is interpolated by a curve which is as smooth as possible. 

Fig. 9.1 indicates that there is a function of which only a certain number 
of sampling points is given. Through a smooth connection of the sampling 
points (spline curve), the associated function graph can be drawn (black). 
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Possible extrapolations towards the right (predictions) are shown in black 
and grey. The inserted intermediate value from 2003 is supposed to indicate 
the quality of the extrapolation. 

® Remark: This application stems from an earlier edition of the book and is, thus, 
already “dated”. However, such examples may also have their advantages: It is pos- 
sible to ascertain whether trends continued or predictions have actually come true. 
In fact, the web address given in the footnote still existed at the beginning of 2014, 
and we may now give the following real-world comparisons: The predicted value 
of 1.35 billion computers may be contrasted against the actual estimated value of 
1.425 billion. For the years 2015 and 2020, numbers of the order of 2.2 billion and 
2.5 billion are predicted. ® <a 


Bar charts are particularly well suited for juxtaposing two or three compa- 
rable data sets. For example, related bars may be grouped together using 
different colors. Another way of visualizing such data sets is provided by pie 
charts, which are practical especially when dealing with percentages. 


>>> Application: population profile 


erro. ey aay The population profile for Germany from 
0-19 30.2% 20.3% 1950 to 2007 is to be analysed and visuali- 
20-39 = 25.3% 26.5% ved in a bar chart. The given census-based 


40-59 28.97% — 28.3% population statistics are to be used for this 
60 — 79 14.4% 20.6% 


80 — 99 1.2% 4.3% 


purpose. 


Solution: 

Here, we are dealing with comparatively few data (or rather: with strongly 
compressed data). Common sense provided, the results are accessible even to 
non-experts. One needs only the “naked eye” to conclude that the proporti- 
on of young people has decreased, while the proportion of older people has 


increased. 
A fs 1950 
30% By 2007 


1950 2007 
20% 
10% 


0-19 20-39 40-59 60-79 =. 99 fee 


Fig. 9.2 bar chart (histogram) ... Fig. 9.3... and pie chart 


Concerning the bar chart in Fig. 9.2: On the axis of abscissas, we inscribe 
appropriate intervals for the five age groups. Above these, we place rectangles 
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of appropriate width whose height is proportional to the respective percen- 
tages. Since there are only two data columns in the table, we merely need 
to draw two rectangles next to each other, using different colors. The chart 
shows nicely how what was formerly the young section of the population now 
constitutes today’s age group of 60 -— 79. 


® Remark: Such graphical representations are easier to understand for many peo- 
ple than decimal numbers. The ratio of the individual percentages per age group 
becomes particularly clear. This shows that the decline in the youngest age group 
is lower than the increase in the elderly or the oldest. One can see nicely that the 
high percentage of the age group 0-19 almost 60 years ago is reflected today in the 
high percentage of the age group 60-79. ® 


Ad the pie chart (Fig. 9.3): The sum of the percentages is always constant, 
namely 100%. A sector with an opening angle of 360°/100 = 3.6° corresponds 
to one percentage point. 

This time, we separate the two columns of the table. The color-coding refers 
to age groups. It is up to us where we put the starting point on the circle. 
If we place it in such a way that the age group 20 - 39 starts at 12 o’clock 
(thinking of an analog clock), one can already see the following relatively 
easily: If the 20 to 60-year-olds are essentially the “net payers”, then they 
form the scarce majority of the population in the two different distributions. 


® Remark: Therefore, the education system and retirement system (still) works. The 
idea to raise the retirement age to compensate for the casualties of the net payers 
which arise due the “natural course of things” is, thus, an open secret. ® 
Subsequently: In December 2010, the wheel had proceeded in such a way 
that the age groups 0-19, 20-39, etc. were given the following percentages: 
18.4%, 24.2%, 31.1%, 21.0%, 5.3%. 

We already see: Unless we make a statistic about the curvature of bananas or 
the number of brush strokes in paintings, we will always try to use statistics 
to arrive at results which do not appear to be listed. What is important is 
that we have to deal with “honest” data. Fictitious statistics are not even 
half as exciting! aac 


Illustrate connections 


Statistics should not only be done for their own purpose. The information in 
the data is often quite obvious. 


>>> Application: language recognition by means of letter frequency 
How often do individual letters appear in a text? Even if we restrict ourselves 
to a single language, one can only work with samples. The more text is tested, 
the better our understanding becomes. The question is: Are there laws in the 
distribution of letters which are characteristic to a language? 

For the present case, we have initially selected a text by Franz Kafka that 
was found on the Internet and has n = 150,000 characters. This amounts to 
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a 43-page manuscript with 3,500 characters per page. We assign the different 
types of umlaut, ‘a’, ‘6’, ‘ti’, which represent only a small percentage of the 
characters, to the letters ‘a’, ‘o’, ‘u’, and the more frequent letter ‘&’ to ‘s’ 
(counting it as one letter). If we do not distinguish upper and lower case 
letters, then we have 26 different classes. 

If all letters appeared with the same frequency, we could expect a (100/26)% = 
3.85% share for each letter. Fig. 9.4 (vowels drawn in orange) tells us at a 
glance: The letter ‘e’ is the undisputed leader, followed by the consonant ‘n’. 
The letters ‘i’ and ‘r’ show up nearly equally often. 


10% 10% 
SS Se 
> p> 
oO7 | o- 
=| wT S 
oO al o 
5 | 5 
jou iow 
oO o 


| AVES TBU 
pqrstuvwxyz abcdef ghijklmnopqrstuvwxyz 


Fig. 9.4 Franz Kafka vs. Mathematikbuch Fig. 9.5 Italian vs. Croatian 


abcdef ghijklmno 


® Remark: It is interesting to see if the proportion of letters changes over time or 
from application to application. In Fig. 9.4, the “classical” distribution of the Mathe- 
matical Toolbox (right, brighter bars) is compared with the new spelling (again with 
n = 150,000). The distribution of letters qualitatively resembles that of the text by 
Franz Kafka. The amounts of ‘e’ and ‘h’ is slightly lower. We should bear in mind 
though that ‘x’ and ‘y’ are letters that typically appear quite often in mathematical 
texts. ® 


som HT) il: eral 


Fig. 9.6 different languages 


Let us take a closer look at an Italian and a Croatian text (Fig. 9.5). This 
results in a completely different picture: In these languages, the first four 
places are occupied by the vowels ‘a’, ‘e’, ‘i’, and ‘o’. In Croatian, ‘j’ and ‘z’ 


442 Chapter 9: Statistics and probability calculus 


occur strikingly often, while ‘f’ is very rare. The letters ‘q’ and ‘w’ do not 
occur at all. Nevertheless, the two languages differ considerably less than, for 
example, German and Croatian. <<a 


Similar comparisons can be made in different language texts, with relatively 
small differences between individual texts. Like German, English has the 
letter ‘e’ ranked first in the list, but ‘e’ is then followed by ‘t’ and ‘o’. The 
vowel ‘a’, on the other hand, is as frequent as ‘i’. Even in non-mathematical 
English, ‘x’ and ‘y’ are comparatively common. 


® Remark: This frequency is taken into account in the Morse code: Frequent letters 
— especially ‘e’ — have a short Morse character. Morse counted the printing press 
type-case in Philadelphia, where he found ‘e’ 12,000 times, ‘t’ 9,000 times, and ‘a’, 


Regt SSP Ban 
? 


o’, ‘i’, ‘n’, and ‘s’ 8,000 times. This is consistent with our results. ® 


On the basis of such diagrams, it is obvious to conjecture that the language 
in which a text is written can already be recognized by a frequency diagram. 
In the same way, one’s voice can be recognized by a frequency diagram. 


Data collection and grouping 


Mathematical statistics is used in a wide variety of fields: the statistic of 
bicycle burglaries for the insurance industry, formation of low pressure areas 
in meteorology, the relation between the population of a beetle species and 
the number of soil bacteria in biology, or the analysis of the efficacy of certain 
medicines. 


Despite the diversity and variability of the questions one can ask, one should 
proceed in mathematical statistics according to the following basic steps: 
First, the problem has to be formulated. In order to do this, clear concepts 
need to be created. If one is looking for the percentage of alcoholics in a 
country, then first clarify what makes someone an alcoholic. What if one 
wants to count the number of traffic fatalities on roads? Does a person who 
has been in critical condition for over 30 hours then fall out of the statistics? 
(This is how it is currently being handled.) 

Secondly, the experiment has to be planned. Specifically, one has to ask oneself 
how much data will be needed to draw any conclusions from it at all and by 
which criteria the data is to be selected. Only then can the practical execution 
follow. 

Next, the results need to be tabulated and described (creation of graphics). 
Not infrequently, data must also be grouped in a meaningful way. For better 
comprehension, it is important to determine certain measures of the statistics. 
The solution to this problem is presented in this section. 

The last step is probably the most crucial and also the most difficult one: the 
conclusion from the sample to the entirety. It can only work if the previous 
steps are well thought-out and correctly executed. The mathematical solution 
is developed in the following sections. 
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The last step is omitted when all data is available. A typical example is 
population statistics. 


>>> Application: age pyramid 

If the age distribution of the population is examined on the basis of a census, 
then for example, everyone born in the same calendar year can form a class. 
You can also distinguish between males and females. Fig. 9.7 presents the two 
histograms by plotting the relative frequency of a certain age group upwards 
or downwards depending on the sex. 


surplus of women 


male 


female 


surplus of men 
Fig. 9.7 age distribution of the Austrian population 2005, broken down by sex 
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Fig. 9.8 the development of the age distribution: Strong birth years “move to the 
right” (red curve: 5 years later). 


Two different colors are used. To put it more precisely, both histograms are 
plotted on both sides, but in reverse order. This illustrates the difference 
between the two frequencies. In the present example, it is seen that the male 
proportion is greater up to the age of about 60 years. However, then the 
situation turns around, and the 80-year-olds and older clearly have a female 
surplus. 
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® Remark: Such statistics are extremely important to the expert. They allow us 
to make fairly accurate predictions of how the age distribution will develop in the 
following years. Fig. 9.8 shows how the age groups move “to the right” over the 
years. Therefore, one can predict relatively early, for example, when there will be a 
surplus or a lack of teachers for certain grades. ® <<< 


Mean, median, and other measures 


One can link the data of a sample to a number of measures in order to assess 
the data better. 


The arithmetic mean & of a measurement series 7;, i= 1,...,7 is: 
Gyt+egt...+2 iy 
j : = Leet Ln 
arithmetic mean: % = =-—- > Dis 
Ww n j=l 


Further, there is certainly at least one minimum value or one maximum value 
respectively, and the difference between the two values is called the span. 

If you arrange values according to size, you can look for a value which is 
smaller than or equal to at least half of the values, as well as larger than or 
equal to the other half. This value is called the median. 

If the amount of data is even, then there are two values which satisfy this pro- 
perty, so we define the median to be their arithmetic mean. Mathematically, 
we can formulate this roughly as follows: For a random (one-dimensional) 
sample 2; (i= 1,...,n), the median is the value x(,,,1)/2 if n is odd and it is 
the value 1/2 + (2p/2 + 2n/o41) if n is even. 

>>> Application: monthly wages 

We consider the monthly wages of a fictitious department supervisor of a 
company (4,000 €) and three other employees (2,000 € each). The amount 
of data ordered by size is 


{2,000, 2,000, 2,000, 4,000}. 


The minimum value is 2,000. The maximum value is 4,000. The mean is 
+(3 -2,000+1-4,000) = 2,500. The median is the arithmetic mean of the 
second and third value, namely 2,000. 

In this case, the median is already much lower than the arithmetic mean. 
Even more extreme is the difference when one looks at the fees of professional 
athletes. Here, the median reflects the reality much better than the mean. 
In an oil sheikdom, of course, most people have a comparatively lower — and 
to some extent, equal — income (the median income), while a few individuals 
move millions on their accounts. <<< 


The arithmetic mean and the median are different in general, and neither of 
the two values is the arithmetic mean of the minimum and maximum value. 
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In contrast to the mean, the median is, at least for an odd n, a value occurring 
in the sample. On the other hand, we are accustomed to accept that, for 
example, a Central European owns 0.8 cars on average (the median would 
probably be 1). 

>>> Application: The majority is often better than the average. 

If 90% of all drivers are accident-free within one calendar year and the re- 
maining 10% have exactly one accident, each driver has an average of 0.1 
accidents during this period. The median, on the other hand, is certainly 0, 
which insurance companies use to their advantage. <<< 


op... 


Fig. 9.9 different mean values 


>>> Application: The arithmetic mean does not always make sense. 
Consider the largest and smallest land mammal, that is, an elephant (mass 
4,000 kg, body length 4 m) and a shrew (mass 4 g= 4/1,000 kg, body length 
4 cm). The arithmetic mean provides an animal with a mass of 2,000 kg and 
a body length of 2 m. This is most likely to correspond to the mass and body 
length of a hippo. 

The geometric mean % =,/%1 - £2, on the other hand, yields a living creature 
with a length of 40 cm and a mass of 4 kg, which comes close to a domestic cat. 
Among the 5,000 known mammals, there is practically no species between 
the hippo and the elephant, while there are certainly hundreds of species 
which are bigger or heavier than a cat. <<< 


A variable which is important for a probability distribution (see p. 487) is 
the variance. This is the mean square deviation from the mean value @. It 
indicates to what extent the data differs from %. The less deviation there is, 
the more meaning the data has. Variance and standard deviation are defined 


by: 


; 1 < = ae 
variance 0” = Soa (2; -%)?, standard deviation ¢ = Vo?. 
Th 5 
i=1 
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9.2 Probability — computing with chance 


The theory of calculating probability first developed from questions of gamb- 
ling. There are exact probabilities under ideal conditions. However, mathe- 
matical statistics must additionally provide methods which allow relations to 
be identified under non-ideal conditions (as they usually occur in practice) 
with sufficient certainty so that reasonable forecasts can be made. 

The term “probably” is already established in colloquial language. It will 
probably rain tomorrow. A certain politician will probably win the election. 
Your partner will probably be worried if you are two hours late and you 
cannot be reached on the mobile phone. All of your bones will probably 
break if you jump down to the street from a height of 10 meters, and it 
would be a miracle if your bones remained unharmed. The chance of being 
hit by lightning is higher than that of your boss admitting to being wrong. 
This is not the mathematical concept of probability, which is not necessarily 
contradictory to subjective perception. In the weather forecast, we are told 
that there will be a 60% chance of rain tomorrow. The survey results yield 
a 5% lead of candidate A to candidate B. The probability of being hit by 
lightning is higher than that of a plane crash. 
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Fig. 9.10 Left: How often does it happen that you have to pay a round amount to 


the cashier and do not have to hassle with change in your hand? Right: The mileage 
222,222 in your own car is already something special. 


>>> Application: the little joys of everyday life (Fig. 9.10) 

If you pay for more than 15 or 20 items at the cashier, you will rarely get a 
round Euro amount (as in the image on the left). Any Cent-sum can occur, 
so that there is a 1: 100 probability of a round Euro sum. However, if all 
items cost 0.99€, 1.99€, etc., you will need exactly 100 items. 

When driving in your car and looking at the kilometer distance on your trip 
recorder, the probability of spotting 6 identical digits by accident (Fig. 9.10, 
right) is nearly zero. Even if you hope to spot the number 222, 222, there is 
only a moderate chance of actually seeing it in your own car. 
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® Remark: According to an estimate from the USA,! a car drives an average of 
204,000 km. Still, 10% of American cars reach 200, 000 miles (* 322,000 km) and are, 
therefore, potential candidates for the 333, 333 km threshold. Reaching 111,111 km, 
on the other hand, is very likely. @ <<< 


>>> Application: Friday the Thirteenth 

Some people see it as a lucky day. Others dread it as a black day. Many 
believe that this constellation of calender day and weekday is quite unlikely 
to occur. However, the thirteenth can fall on any of the seven days of the 
week, and it occurs as often as the twelfth or fourteenth day of a month. On 
average, every seventh thirteenth will fall on a Friday, and we can experience 
the event about twice a year. 


® Remark: A more detailed study of how the thirteenth of each month is distributed 
among the seven days of the week (in this case, it is of no relevance when we 
start our calculation) will yield the ratio of 687:685:685:687:684:688:684 for the 
occurrence frequencies among weekdays beginning with Sunday. The ratio shows 
that the thirteenth falls more frequently on a Friday (printed here in bold) than on 
any other day of the week (though the difference is barely significant).? ® <<< 


Fig. 9.11 The probability of being killed by a shark — here a white shark (Car- 
charhinus carcharias) — is almost zero. People are actually the real monsters who 
slaughter millions of these animals only for their fins. 


>>> Application: the fear of a shark attack 
Every year around 20-30 people are attacked by sharks worldwide. An ave- 


1 Vehicle Survivability and Travel Mileage Schedules. National Center for Statistics and Analysis 
(1/2006). 
*https://en.wikipedia. org/wiki/Friday_the_13th 
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rage of 6 people succumb to their injuries. As a result, billions of people are 
afraid of swimming further out into the sea. 

If we make a deliberately low estimate that perhaps every thousandth person 
has at least once the theoretical chance of splashing near one of the dreaded 
monsters during a beach holiday, then we have 6 actual and 7 million possible 
cases, i.e. a ratio of less than 1: 1,000,000 for this nightmare to become true. 
Interestingly enough, some people are comforted by the fact that about three- 
quarters of the attacked people survive, although this hardly changes any- 
thing about the order of the magnitude in question. 

For comparison: From a total of about 100 million people in the German- 
speaking world, about 7,000 die in road accidents per year. Here the proba- 
bility is 70 times as high (70: 1,000,000). 

In return, unscrupulous humans slaughter 100 to 200 million sharks per year, 
often for the mere purpose of cutting off their the coveted fins (the meat. is 
less tasty). The probability of a shark being killed by human hands is so high 
that these animals are almost extinct. <<< 


Where do we deal with probabilities? 


It is comparatively easy to find examples of applications in everyday life 
involving probability calculations, simply because so much in life and nature 
has to do with chance. Here, we will discuss two examples that, at first glance, 
may not appear to be related to statistics or probability: 


>>> Application: goals in a soccer match 

In soccer — compared to other ball games — few goals are scored. Is this good 
or bad for a weaker team? 

Let us assume that in the last ten matches, team A scored half the goals of 
team B. 

If A scores a maximum of one goal during the match in question and B scores 
a maximum of two, then the six possible game outcomes are 0:0, 0:1, 0: 2, 
1:0, 1:1, 1:2. Thus, A loses in three cases and wins in one case. In the 
remaining two cases, the game is drawn. Therefore, in half of the cases, A 
makes at least one point (50: 50). 

If A scores a maximum of two goals and B scores a maximum of four, then 
the outcomes 0:1, 0:2, 1:2,0:3,1:3,2:3,0:4,1:4, and 2:4 area 
loss for A. The draws 0:0, 1:1, and 2:2 are at least a partial success. The 
results 1:0, 2:0, and 2:1 are a victory. This time, A reaches only one point 
in 6 out of 15 possible cases (40 : 60). 

This shows that if fewer goals are scored in a game, the weaker team is more 
likely to score a point. This is due to the fact that the probability of a draw 
is higher. <<< 


>>> Application: bubble formation in a water pot (Fig. 9.13) 
When we boil water in a cooking pot, small gas bubbles develop randomly 
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Fig. 9.12 few goals are thrilling ... Fig. 9.13 gas bubbles 


on the ground. As the temperature increases, bubbles start rising, and the 
physicist speaks of Maxwell’s velocity distribution — which we shall return 
to in the next chapter — and new and larger bubbles are formed gradually 
bringing the water to rest. 

If the pot is centered on the hearth, the water will rise evenly despite the 
constantly new situation of the fire. In sum, the probabilities are the same, 
and the place where the bubbles are formed no longer plays a role. <<< 


Profit outlook as motivation 


The beginnings of probability theory go back to Galileo Galilei, Blaise Pas- 
cal, and Pierre de Fermat in the 17th century. It began with the profit 
outlook for games of chance. At the beginning of the 18th century, two basic 
works on probability theory appeared: one written by Jacob Bernoulli (Ars 
conjectandi) and another by Abraham de Moivre (The Doctrine of Chances). 
Pierre Simon de Laplace refined the theories in his Théorie analytique des 
probabilités a hundred years later. Thus, the classic concept of probability 
was coined. This concept is still used today in game theory. Later, the term 
had to be expanded to be suitable for statistics. 


Classical experiments 


ae. £ ck | 
Fig. 9.14 The toss of a coin is a classical experiment with only two possible results 
(except when the coin lands on its edge, which is quite unlikely). Throwing a drawing 
pin is a random experiment, but not a Laplace experiment, because the probability 
of landing on the head is greater than that of balancing on the tip. 


450 Chapter 9: Statistics and probability calculus 


e If we throw a die, we can expect a number from one to six. Nobody can 
predict the exact outcome. 


e A coin tossed high is, so to speak, a die with only two sides (“heads or 
tails”). 


e Also, when you draw a card in a card game, you have a well-defined 
number of options. Yet, if the cards are well shuffled, the result is not 
predictable for each attempt. 


e This analogously applies to a blind person grasping for something in an 
urn in which there are balls that only differ by color. 


Random experiment 

What we mean by a random experiment (experiment, for short) is an arbitra- 
ry, often repeatable process which is carried out according to a very specific 
set of rules, the result of which cannot be clearly predicted. In other words, 
the random experiment’s result “depends on chance”. 


>>> Application: Is there a random experiment? 


e Tossing a drawing pin is a random experiment with two possible results: 
the tip facing up or down. 


e The case of buttered bread falling from a table to the ground is, strictly 
speaking, not sufficiently defined to qualify as a random experiment: The 
result depends strongly on the height of the table: The bread does not fall 
on the buttered side because it is heavier. Due to the usual table height, 
it is, in fact, likely to perform a half-turn. 


e Choosing a person for the purpose of investigating various data (e.g. cho- 
lesterol levels or telephone accounts) or checking the expiration date of 
a product in a supermarket can be a random experiment if the selection 
criteria is predetermined. 


The result of the experiment 


An experiment has different results by definition. For each attempt, a given 
result (event) A may or may not come true. If we perform the experiment 
n times and A occurs k times, then k is the absolute probability of A. The 
relative frequency h(A) is then defined by 


number of trials with result A 


h(A) = (> 0<h(A) <1). 


number of trials 
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>>> Application: boy or girl (1) (Fig. 9.15) 

In one day, 14 boys and 16 girls are born in a birth clinic. The two possible 
events are A: birth of a boy, B: birth of a girl. The corresponding relative 
frequencies are 

h(A) = 14/30 » 0.47, h(B) = 16/30 » 0.53. Countless statistics show that the 
ratio of boys : girls at birth is not exactly 1: 1, but that more boys are born 
on average around the world. See also Application p. 493. 


Fig. 9.15 Three to four babies every second worldwide ... 


Smaller differences between individual countries are generally related to mar- 
riage habits, which tend to have impact on the age at which women have ba- 
bies: Older mothers have a slightly higher probability of having a boy. «<< 


Can events be related to each other? 
We shall define the following: 


Sum event and product event Considering the event in which either A or B 
occurs, one speaks of the sum event AU B. The event which occurs when both 
A and B occur is called a product event An B. 


® Remark: In different textbooks, there are alternative notations for the sum event 
and the product event: Instead of AU B, one also writes A+ B or Av B. Instead 
of An B, one also writes AB or AA B. From the context, it is usually easy to see 
which notation is preferred, and because there is no possibility of confusion, this is 
not a problem for us. ® 


>>> Application: events when rolling dice 

A: rolling a 6, B: rolling a 3, C: rolling an even number. Then, by definition, 
AUB: rolling a number divisible by 3, AUC: rolling an even number, BUC: 
rolling an even number or a 3 (or alternatively phrased, rolling a number 
other than 1 and 5). Furthermore, An B: is not possible, An C: rolling a 6, 
BoC: is not possible. From the products one sees immediately: The events 
A and B on the one hand and B and C on the other hand exclude each other 
mutually. <<< 
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>>> Application: ticking-off 

A questionnaire for an examination according to the “American system” is 
considerably easier to fill out if only one answer is possible, that is, if the 
answers are mutually exclusive. A lot more knowledge is required to tick the 
correct answer(s) in a multiple-choice test. ac 


For the relative frequencies of two events A and B, we have: 


A and B exclude each other: h( AUB) =h(A)+h(B), h(An B) =0; 
in general: h(An B)=h(A)+h(B)-h(AUB). 


Proof: As in the above two formulas, let A and B be two events that can occur in 
an experiment. In n trials, exactly four cases can occur in each experiment: 


1. A and B occur simultaneously (7 times); 
2. A occurs but not B (j times); 
3. B occurs but not A (k times); 


4. Neither A nor B occur (n-i- j—k times). 


Thus, A has occurred exactly 7+ 7 times, B occurred exactly i+ k times, AU B 
occurred exactly i+ j7+k times and An B occurs exactly i times. This, by definition, 
gives the relative probabilities 

i+gt+k 


iaaBy= © © 


nn? n 


a+4 itk 
h(A) = —*, h(B) = —, h(AvB) = 


>>> Application: Fibonacci number and/or prime number 

Consider the integers n up to 20 and the occurrences 

A: nis a Fibonacci number (p. 530) (a € {2,3,5,8, 13}); 

B: n is a prime number (ne {2,3,5,7, 11, 13,17, 19}). 

We have h(A) = 5/20, h(B) = 8/20, h(An B) = 4/20, h(Au B) = 9/20. In 
fact, the second of the above formulas holds true. <<< 


>>» Application: boy or girl (2) 

In Application p. 450 with 30 births (14 boys, 16 girls), the events A and B 
(birth of a boy or a girl) are mutually exclusive. The sum event AUB is called 
the “birth of a boy or girl’. Therefore, h(AU B) = h(A) + h(B) = H+ 38 = 1. 
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9.3 The probability concept 


We have used the word probability so often that it is time to specify its 
definition explicitly. Let us begin with a definition that goes back to Laplace. 


Classical definition of mathematical probability: 
The probability P(A) of an event A in a random experiment is given by 
number of events with result A 
P(A) = 


number of equally possible cases’ 


Therefore, we need a number of equally possible cases. Let us examine the 
examples we have seen so far as to whether we have equally possible cases. 
e The tossed coin provides only two cases, and both are equally possible 


with sufficient accuracy. Thus, the probability that a coin will land on the 
side A is P(A) = 1/2. 


e In the case of dice, the six possible events ideally occur equally frequently. 
Thus, the probability of rolling a 4 is as high as rolling a 1, namely 1/6. 


The rolling of regular 12-sided solids (dodecahedra, Fig. p. 437) has 12 
equivocal results with the probabilities 1/12. 


In a card game (with n cards), the probability of drawing a certain card 
from a well-shuffled deck equals 1/n for all cards. The same applies to the 
random removal of n balls from an urn. 


e We get in trouble with the Laplace definition when tossing a drawing pin, 
because the two probabilities are different. 


e Also, human birth is, strictly speaking, not aLaplace experiment, because 
marginally more boys are born than girls. 


Probability and relative frequency 

So, we see that we need to extend the original definition. Our starting point 
is the following observation: In most random experiments, the occurrence of 
events is in the long run subject to certain laws. These laws must, of course, 
be proven by experiments. 


Definition of probability via relative frequency: 

If the relative frequency of an event A converges to a fixed value as the number 
of experiments increases, this value is called the probability P(A) of the event 
A for the respective random experiments. 
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The classical concept of probability fits seamlessly into the generalized defi- 
nition. In this case, one assumes the equality of the events. 

>>> Application: /t will soon settle again. 

One is tempted to say: The relative frequency “oscillates”. This concept, ho- 
wever, gives the impression that there is a certain “restoring force”, but there 
is no such thing (see p. 476). We only have the following statement: 


A “temporary excessively high frequency” is only “broken down” by the fact 
that the number of events in the future will increment the total number of 
events (the numerator) at a much slower rate than the number of trials (the 
denominator). 


ddd 


One should always be aware of this fact. Random experiments do not re- 
member what happened in the past. 

>>> Application: tossing a drawing-pin 

In order to ascertain if a certain relative frequency/probability occurs when 
tossing a drawing-pin, it should be thrown 50 times, then, for example, ano- 
ther 50 times, etc. One can also shorten the experiment by tossing 50 drawing- 
pins on a table. However, the drawing-pins can mutually influence each other, 
as can be seen on the right in Fig. 9.14. 

The relative frequencies of the two possible events will only change in the 
second decimal place, and later only in the third decimal place. Even more 
accurate results do not make sense in this example. <<< 


It can happen that empirically determined probabilities change with the pas- 
sage of time. For example, with changing living habits (women being much 
older for their first pregnancy), the distribution of the sexes in births was 
found to change at least from the third decimal place onward in favor of the 
male sex. 

The probability introduced here is, thus, the theoretical counterpart to the 
empirical nature of relative frequency. Therefore, it is not difficult for us to 
accept certain assumptions based on relative frequencies when these assump- 
tions are not further justified. This so-called system of axioms was laid down 
by A.N. Kolmogoroff (1933). 

From these axioms one can now draw conclusions. Consider, for example, 
two opposite (complementary) events A and A: 

If A occurs, then A does not occur, and vice versa. Thus, AU A is the definite 
event and we have: 


P(A) + P(A) = P(AUA) = P(S)=1. 


This leads to the following formula, which is often used in practice for com- 
plementary probability. 
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Complementary probability: P(A) = 1- P(A). 


>>> Application: rolling two dice 

What is the probability of reaching a maximum of 11 total points with two 
dice (event A)? 

Solution: 

This can be solved either by writing down all the cases in which the sum of 
eyes is less than or equal to 11 or by looking at the complementary event 
A: The sum of the eyes is greater than 11, i.e. equal to 12. This can only 
be achieved with two 6s. Among all the 36 equally probable combinations (6 
possibilities for the first die and 6 possibilities for the second die), there is 
only one in which A is true. Thus, P(A) = 1/36 and P(A) = 1-P(A) = 35/36. 


ddd 


Adding and multiplying probabilities 

We generalize and define the sum of m events A; UA gU...UA as the event 
that occurs when at least one of the events mentioned occurs. Similarly, we 
define the product of m events as Ayn A2g...N Am. For this product, all events 
must occur simultaneously. 

According to Kolmogoroff, the following arises by induction for mutually 
exclusive events A,, Ag, ..., Am: 


P(A, U Agu...U Am) = P(A,) + P(Ao) +...+ P(Am). 


Fig. 9.16 Two dice show the same face Fig. 9.17 Dice was played in the 17th 
in this throw. Apart from this, the throw century and was very popular. However, 
is invalid because the red die does not lie if you think about the gambling boom of 
flat on the board. recent times ... 


>>> Application: three different results 

How likely is it to get three different results while throwing three dice at the 
same time? 

Solution: 

If the first die shows 1, then the second die will only have five options, and 
the third will have only four, so that no numeric repetition occurs. This 
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yields 5-4 = 20 possibilities. There is an equal possibility to get a 2 on the 
first die, etc. This yields 6-20 = 120 possibilities, and the probability of the 
sum event is 120- 1/216 = 5/9, which is about 56% according to the addition 
theorem. aa 


>>> Application: two questions from the beginning of probability 

Galileo Galilei was able to answer the following question, which had been 
discussed for centuries and was asked by the Prince of Tuscany: Why is the 
sum of eyes on three dice more likely to be 10 than 9 even though both 
sums can occur in six ways? Blaise Pascal solved for the Chevalier de Méré 
(a philosopher and literary man at the court of Ludwig XIV) the following 
problem: Is it more probable to roll a 6 with one die that is rolled 4 times or 
to obtain two 6s with two dice after 24 rolls? 


Solution: 

First, the task of Galilei: 

In total, we have 6° = 216 equally probable cases. The sums 9 or 10 can both 
be achieved in 6 ways: 


9=14+24+6=143452=1444+4=2424+5=2434+4=343+3 
10=14+34+6=14+44+5=24+24+6=24+44+4=24+345=34+3+4 


Triples with three different numbers of eyes, e.g. (1, 2,6), can occur in 3-2-1 = 
6 combinations. Triples with two equal numbers, e.g. (1,4,4), can only occur 
in three ways. The triple (3,3,3) can occur only once. Therefore, there are 
only 25 possibilities for the sum 9, but 27 possibilities for the sum 10. So, 
the two sums occur with the probabilities 25/216 » 0.116 and 27/216 = 0.125 
respectively. On average, the sums 9 or 10 appear with almost every fourth 
roll. 

If you consider how many attempts you have to make to show the slight 
difference in probabilities, you can only wonder why the fact was so well- 
known. In the next chapter, we will determine the significance with which 
we can test the hypothesis. 

Let us now turn to the question of Chevalier de Méré: 

Four rolls with a single die provide 6* equally likely cases. The counter-event 
of “rolling at least a 6” is called “rolling no 6”. For this, there are 54 possibilities 
and, thus, the probability is (5/6)* » 0.482. The desired complementary 
probability is 1 — 0.482 = 0.518. 

A roll with two dice has 36 equally likely occurrences, including (with the 
probability of 1/36) the double 6. The opposite of “at least one double 6” is 
the event “no double 6” (probability 35/36). If this is to occur 24 times in 
succession, the probability is (35/36)?4 = 0.509. The desired complementary 
probability is 1 - 0.509 = 0.491. So, this event is marginally less improbable. 
An additional question of de Méré is also worth mentioning: If the probability 
of rolling a double 6 with two dice is slightly less than 50%, then for how 
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many throws does the probability exceed 50%? Let n be this number. The 
complementary probability must now be less than 0.5: (35/36)” < 0.5. We 
raise the inequality to the power of —1, which reorients the inequality sign. 


Thus, we get 
—]| >2. 
35 
This can be solved by tests or using the logarithm: 


36 
n- log 5. >log2 > n> 24.6. 


This means that it is favorable to bet on the double 6 from 25 rolls onward. 
According to another question, which is completely different from de Méré’s, 
two players A and B agree that the first person who scores five points in the 
coin toss game will win. After seven throws, the game ends, with A having 
scored four times and B having scored three times. How should they split 
the winnings? 

Pascal solved the question as follows: A wins with the probability of 0.5 
on the next toss. So, A already has half of the money. If A loses the next 
roll, B has four points, and then both have the same chance of winning the 
remaining prize money. Consequently, the prize money is divided in the ratio 
3:1 for A. <<< 


>>> Application: sweepstake winnings 
What is the probability of a sweepstake-12 or sweepstake-11? What is the 
probability of having no more than one correct tip? 


Solution: 

In the case of a soccer match, the first team can win (tip 1), the second team 
can win (tip 2), or the game ends in a draw (tip X). You have to guess the 
correct outcome for 12 games in succession, in order to have a 12. For each 
game, the probability is 1/3. According to the product rule, the probability 
of a 12 equals 


ie | a ee | 
3 3 °° 3 312 500,000° 
OO 

12 times 


For an 11, one tip must be wrong (probability 2/3), and the remaining tips 
must be correct. For example, the probability of the first tip being wrong 
and the following tips all being correct is 


9 1.1 a 
3.33 °° 3 3!” 250,000° 
—— 
11 times 


However, the second or the third tip could be wrong. This results in the 
12-fold value, i.e. about 1/20,000, for guessing the outcome of 11 matches. 
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It is also quite difficult to get all tips wrong. The probability of this is, in 
fact, 


2 2 2 (2) 1 
33 °° 3° \3/ ~~ 130° 
—— 

12 times 


The probability of having exactly one right tip is calculated, in analogy to 
our calculations for 11 tips, as 


122 2 ah 4 
19:8 6 Ss Sg Sa x, 
3 3 3 3 3 22 
ee 
11 times 


In sum, only every nineteenth randomly completed sweepstakes coupon will 
have a maximum of one correct result. 


® Remark: The result of the clash of two soccer teams is, however, not a random 
event. If you know the previous matches of both teams, you can increase your 
chances of a 12. ® <<< 


>>> Application: two are better than one 

Two different early detection devices detect a tumor with a probability of 
70% or 80%. What is the probability that the tumor will be detected when 
both devices are used simultaneously? 

We first answer the opposite event: The tumor is not detected by either the 
first or the second device. The event A in which the first device fails to detect 
the tumor has a probability of P(A) = 0.3. The event B in which the tumor 
is not detected by the second device has a probability of P(B) = 0.2. Thus, 
P(An B) =0.2-0.3 = 0.06. There is a probability of 6% that the tumor will 
not be detected. Or put differently, there is a 94% probability of the tumor 
being detected by at least one of the two devices. 

® Remark: In practice, it may well be the case that seemingly independent devices 


are based on the same principle. In this case, the calculation is no longer correct. ® 


ddd 
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9.4 Conditional and independent events 


So far, we have often calculated the probabilities of independent events: New 
game, new chance, as is often said in casino jargon. When dice are rolled, 
the same probability is obtained, no matter how many times we have already 
rolled. Let us now examine the probability of dependent events. 


>>> Application: consider the flip-side 

On the shelf of a fruit stand (Fig. 9.18), there are 10 fruits 3 of which are 
rotten at the back. We pick a fruit and put it into the shopping basket because 
we assume the fruit to be fresh. What is the probability that the next fruit 
will also be good? 


en 
Fig. 9.18 Left: The available fruits that are wonderful. Right: As beautiful as they 
looked in the front: one of the two apples is rotten! 


The experiment consists of drawing one fruit out of ten, and all ten cases 
have the same probability (i.e. are equally likely to occur). 

Let A be the event that the first fruit is not rotten. Its probability is 7/10 = 
0.7. If this event happens, there are only nine fruits remaining on the shelf. 
Six of these are not rotten at the back. 

Let B be the event that the second fruit is not rotten when the first fruit 
has been returned. Its probability is the same as A (if the position of the 
returned fruit is not memorized). We are only interested in the probability 
of B under the condition that A has already happened (symbolically BA). 
This is 


6 
P(BIA) = = = 5 * 0.67. aa 


wl re 


Bayes’s formula 


We shall now examine further dependent events in a more general way and 
prove a formula which is frequently applicable in practice. It is attributed to 
the 18th-century English mathematician Thomas Bayes: 


460 Chapter 9: Statistics and probability calculus 


Conditional probability — Bayes's theorem: 
The probability of the event B under the condition that an event A has occurred 
equals 


P(BIJA) = P(AnB)/P(A). (9.1) 


® Remark: We also speak of the conditional probability of the event B under the 
hypothesis of A. ® 


Proof: Let us consider an experiment in which a total of n equally likely cases can 
be distinguished. Furthermore, two events A and B are to occur. In m cases, the 
occurrence is An B (both A and B). In k cases, the occurrence is A (and not B). 
In / cases, the occurrence is B (and not A). On the basis of the classical probability 
definition, we have the probabilities 

k l m 

P(A)=—-, P(B)=—-, and P(AnB)=-—. 

n n n 
Let us now determine the probability of B under the additional condition that only 
the cases in which A occurs (symbolically P(B|A)) are considered. 
This new probability is obviously 


m 
P(BIA) 7 ‘ 
as this time we only consider k cases where B occurs m times. The expression makes 
sense only if we exclude k = 0. So, we have 


m 


k; 


_ P(AnB) 
P(A) 


3l|3 [3 


By interchanging the labels, it is immediately obvious that 


P(AnB) 


P(AIB) = Fry 


This yields the following theorem, which is important in practice: 


Multiplication theorem: 
If two events A and B have the probabilities P(A) and P(B) in an experiment, 
then the probability that A and B occur simultaneously is 


P(AnB) = P(A)- P(BJA) = P(B)- P(AIB). (9.2) 


>>> Application: two at one blow 
Let us return to our previous example of the 10 fruits at the fruit stand where 
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three pieces are rotten at the back. What is the probability of picking two 
fresh fruits when exactly two pieces are taken from the fruit stand (Fig. 9.18, 
right)? 

Let A and B again be the events in which the first fruit or the second fruit 
is at rotten. Obviously, P(A) = P(B) = i. Now, we have seen: P(B|A) = 


97 3° 
The probability that both events occur simultaneously is given by the mul- 
tiplication theorem 


P(An B) = P(A) P(BIA) = 7 2 60.47. 


wile 


>>> Application: checking a random sample 

There are 4 smugglers among 9 people. What is the probability of getting 
three smugglers when inspecting a group of three persons? 

The probability of catching a smuggler when picking the first person is 4/9. 
Now, there are still 8 people among which there are 3 smugglers. The proba- 
bility of the second person being a smuggler is 3/8. Similarly, the probability 
that the third person is another smuggler is 2/7. According to the multipli- 
cation theorem, we have a probability of 


432 1 


9°87 OI 
that everyone is a smuggler in a group of three. This is a chance of less than 
5%. An event with a probability of less than 5% is considered “unlikely” in 
statistics. <<< 


>>> Application: the birthday problem (Fig. 9.19) 

What is the probability that the birthday of each member in a group of 5 or 
7 people will fall on a different day of the week? 

The first person may still have his/her birthday on any day of the week. The 
second has, then, only 6 days “to select from”. Thus, there is a probability of 
6/7 for him/her to be born on another day, etc. In the case of 5 people, we 
have a probability of 


For seven people, the probability is reduced to less than 1% by the factors 2/7 
and 1/7, which is already extremely unlikely. With more than seven people, 
it is impossible to have only one person for each day of the week. 

In the same way, only with more computational effort, one shows that 23 
people in one room are enough to have more than a 50% chance that at least 
two people in the room have their birthday on the same day of the year. 
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Fig. 9.19 This combination (7 persons whose birthdays fall on 7 different weekdays) 
is a very lucky event and occurs with a probability of less than 1%. 


The probability that 22 people were born on different days of the year equals 


365 364 365-21 © 
365 365 365 


0.5243, 


which is just barely over 50%. For 23 people, it falls below the 50% mark: 


365 364 365 — 22 


. ha? = 0.4927. 
365 365 365 


With twice as many people, the probability of the event is 


365 364 365 — 45 


. ak = 0.05175. 
365 365 365 


So, in this case, the event is very probable (about 95%). With 70 people, we 
have 


365 364 365 — 69 
365 365 365 
and the event is nearly certain to occur (99.9%). 
We do not want to solve the problem by trial-and-error, but by a formal 
equation. First, according to Formula (8.5), which provides the approxima- 


= 0.00084 » 0.001 


tion 1+ ta x e', we have 
n-1 : n-1 : n-1 
365 — 1% i 
[| =[[Q-— ser ae, 
<7 ©6365 LI( 365) “ 


which is good for fairly large n. We can reshape using the rules for calculation 
with powers, and get 


H =e Ur 1365 _, In A = Mn 365, 
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Now, let H = 0.5. Then, 
n(n-1) 
730 


In0.5 = —- 


which is equivalent to 
n” —n- 505.997 = 0. 


This quadratic equation has the solutions 


n= 0.5 + V0.25 + 505.997. 


The positive solution is n ¥ 23. <<< 


>>> Application: lottery win 

What is the chance of winning the lottery “6 from 49”? 

For the lottery “6 from 49”, there are 49 balls numbered from 1 to 49. Six balls 
are arbitrarily chosen (and the numbers are listed in order of the magnitude 
for the sake of clarity). What is the probability of guessing all six numbers? 
Suppose that we immediately guess one of the six numbers. The probability 
of this event is 6/49. Now, we have to guess the second number, which is 
already a bit harder, because only 48 numbers are available and only 5 of 
them are suitable for us (probability 5/48), etc. So, guessing the sixth number 
has a very small probability of 1/44. This gives a product probability of 


6s A Bo f.. 1 
49 48 47 46 45 44 14 million 
for winning the lottery. 


® Remark: The Austrian variant of this lottery game uses 45 balls (“6 from 45”). 
For this variant of the game, the probability is about 1:8 million. So, there is no 
“quantum leap”, and you are “just” twice as likely to guess the right combination. 
This tiny benefit will barely help. © daa 


>>» Application: spread of an infection (Fig. 9.20) 

Three juxtaposed plants run a 50% risk of being infected with mites. They 
infect their neighboring plants with a probability of 20%. What is the pro- 
bability of the middle plant becoming infected? 

In any case, the middle plant becomes infected with a probability of 0.5. If 
the middle plant is not diseased, the probability that the left plant becomes 
infected, but not the right plant, is 1/8 (one of eight equally possible cases). 
However, since the left plant then infects the middle one with a probability 
of 1/5, the probability of the middle plant becoming infected increases by 
1/8-1/5 = 1/40. The same applies to the case in which only the right plant 
is infected. Now, there still remains the case of both the left and right plant 
being diseased (each with a probability of 1/8). In this case, the probability 
that the middle plant will not become infected is (1-1/5)? = (4/5)? = 16/25, 
That is, the middle plant will become infected with a probability of 1-16/25 = 
9/25. In total, we have a probability of 1/2 + 2- 1/40 + 9/200 = 0.595 that the 
middle plant will be infected with mites. <<< 
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Fig. 9.20 Monocultures are particularly vulnerable. 


Reversing conclusions 


As we have seen, the calculation of P(event|cause) is quite frequent. However, 
it is not infrequent for P(causelevent) to be sought, i.e. for the argument to 
be reversed. 


>>> Application: testing twice before drawing conclusions ... 

Some diseases occur very rarely. Physicians use the term prevalence to de- 
scribe the chance of a person carrying a disease. Let now 1 : 5,000 be the 
prevalence for a particular disease. That is, on average, every five thousandth 
person carries the disease. Assume further that there is also a screening test 
for this disease which detects with a 99% reliability whether the illness is 
present or not. What is the probability that the test sounds the alarm even 
though the disease is actually not present? 

We solve the example “with common sense”. Every stick has two ends: If the 
test works with 99% reliability, it fails with a 1% probability. This means 
that it will sound the alarm for 1% of the 4999 non-sick persons, that. is, 
at least 50 times. The patient is almost certainly identified. This makes 51 
alarms. 

If we now test the same 51 people again, the screening test looks much better: 
For the 50 healthy people, the test will give an alarm 0.5 times on average. 
The patient is discovered during the second round. It may still happen that 
a healthy person is mistakenly declared ill twice in a row. So, we need a third 
test! 

If the prevalence is higher, e.g. 1: 100 (a typical rate for HIV-infected people 
in many countries), there is an average of only one false alarm during the 
first test, and the second test is already quite reliable. 

With a low prevalence, even a 99.9% accuracy is of little use. In the above 
case (1: 5,000), we still have five false alarms and one correct alarm during 
the first test. This time, however, the second test is sufficient, because it 
causes the false alarms to drop to 0.005. << 
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Let us solve the above example by means of a formula. The process is also 
known under the name of backward induction. According to Bayes’s theorem, 
we have 


P(A): P(BIA) = P(B)- P(A|B). (9.3) 
Let us now eliminate P(B) from the formula. To do this, we first write 
PUBY =e P(By-1 
and replace 1 by the probability of the definite event A|BuU A|B 


P(A|B) + P(A|B) = 1. 


So, with 
P(B) = P(B)-(P(A|B) + P(A|B)) = 
= P(B): P(A|B) + P(B)- P(A|B), 
P(BIA) P(B\A)-P(A) 
we find 
P(B) = P(BJA) + P(B\A)- P(A). (9.4) 


(9.3) and (9.4) together yield a new formula: 


ye (9.5) 
P(BJA) - P(A) + P(BJA)- P(A) 


>>> Application: the force of habit 

Every eighth American drinks tomato juice for breakfast (in the rest of the 
world only every 80th person). Let us assume that we attend a conference 
where 50% of the participants come from the USA and one participant is 
spotted drinking tomato juice at breakfast. What is the probability that 
he/she is an American? 

Solution: 

According to Bayes’s formula, 


P(AnT) —-'1/16——_ 10 
P(T) = 1/16+1/160 11 


P(AT) = (91%). 
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>>> Application: early detection of rare diseases 

Let us apply Formula (9.5) to our problem dealing with disease prevalence 
and early recognition: 

Let A be the event that the person being tested carries the disease and B be 
the event that the test sounds the alarm. 

Now, we have the following events: 


e A|B: The person is actually sick when the test alerts. 
e BIA: The test responds correctly if the person is sick. 
e B\A: The test mistakenly identifies a healthy person as diseased. 


In the special case, P(A) = 1/5,000 = 0.0002, and therefore, P(A) = 0.9998. 
Furthermore, “according to factory specification”, P(B|A) = 0.99 and P(BIA) = 
0.01. 

Thus, according to the above formula, we get 


0.99 - 0.0002 _ 9.0002 1 
010.99 +0.99-0.0002 0.01 50° 
Now we could debate which of the two methods — the “common sense method” 
or the “formula method” — is better. When applying the formula, you must 
know exactly what you are doing. Once you are sure of your subject, you 


P(AIB) == 


can of course vary more quickly. Let us put it in this way: The computer 
often has the formula, but the danger lies in the person entering the data 
erroneously! <<< 
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9.5 Combinatorics 


Examples such as those of the lottery-6 or the sweepstake-12 show that pro- 
bability calculations often involve having to count many (favorable or also 
possible) cases. It is often difficult to avoid making mistakes. Are there simple 
formulas for counting the number of possibilities? 


Three typical questions 


1. How many combinations are there for k offices to be occupied by k people 
according to the rotation principle? 

The first office can be occupied by k different people. The next office can 
only be occupied by k - 1 people, etc. Overall, there are k-(kK-1)-...-2-1 
combinations. This number already appeared in Chapter 3, and was denoted 
by kl, ie. & factorial: 


kl=k-(k-1)+...:2-1. 


>>> Application: rotation principle (Fig. 9.21) 
Three people share an apartment and want to share the responsibility for 
the three most unpleasant “chores” (vacuum-cleaning, washing dishes, and 
cleaning the toilet) based on a weekly changing rotation principle. How many 
weeks will it take for the cycle to start again? 


Fig. 9.21 rotation principle 


If one of the three flatmates is in charge of the vacuum cleaner, two people 
remain and one of the two must wash the dishes. The final remaining person 
must then automatically do the third chore of cleaning the toilet. Therefore, 
there are 3-2-1 = 6 combinations, and the cycle restarts every six weeks. 

ddd 
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2. In how many combinations can n people occupy & different posts? 

The first post can be taken by each of the n people. For the second post, 
there are only n-1 people available. For the k-th post, only n-k+1 people 
are free. All in all, we have 


n:(n-1)-(n-2)-...:(n-k+1) 


possibilities. With our new notation, this product is easier to write. We ex- 
pand the numerator and the denominator by the factor 


(n-k)-(n-k-1)-...-2-1=(n-k)! 


and obtain 


nm (w= 1) + (n= k41) (RB) 21 onl 
(n—-k)-...-2+1 ~ (n-ky! 


>>> Application: rituals 

In one room, there are 7 people (4 women and 3 men) who will raise their 
glasses to each other on the New Year. What is the probability that a woman 
will raise her glass with a man? 

In analogy to the above example, the glasses will clink 7-6/2 = 21 times 
overall. The women clink their glasses together 4-3/2 = 6 times and the men 
3-2/2 = 3 times. There will be 21-6-3 = 12 mixed pairings with a probability 
of 12/21 » 0.57. <<< 


3. In how many ways can n people be grouped into equal teams of k persons? 
This question can be answered with the solution strategies used for the first 
two questions. First, we have n!/(n - k)! ways to form a team where the 
order is still important. If the order is not important, each team is counted 
k! times. This allows us to form 


n! 
k!-(n-k)! 
teams. 
This is the so-called binomial coefficient n choose k (sometimes called k from 
n) for integers n > k: 


my -%, wal n=k+1 | n! 
k k k-1 0° 1 k!-(n-k)! 


® Remark: The term binomial coefficient stems from the formula 


(a+b)*= @ abo + (7) ar tph sy (;) Ors cnt (,.” ‘ apts () ab” (9.6) 


(see also p. 265), which is easier to understand when written like this: After complete 
kpn-k = 


expansion of the product (a +b)” = (a+b)-...-(a+6), the summand a 
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a:...:a:b-...-b shows up (3) times if we take a out of k brackets and b out of 
k times n-k times 

n-—k brackets. This is precisely the statement posed. @® 

>>> Application: courtesies 

Four or, for a more general consideration, n people in a room shake hands 
to greet each other. How many handshakes occur? 

In the case of the four people, it is still clear: Let A, B, C, and D be the 
people. Then, we can enumerate six relevant pairings: AB, AC, AD, BC, 
BD, and finally CD. Thus, there are six handshakes. 

One also comes to the same result as follows: One of the four people can 
shake hands with three others. Since this is true for every person, we get 
4-3 = 12 pairings. However, we have counted each handshake twice so that 
we actually come to 6 pairings. 

Now, the calculation for n people is easy: Each of the n people can shake 
hands with n-1 others, and in the end, this number is to be halved. The 
general result is n-(n—-1)/2. For n= 4, this is again 4-3/2 =6. <<< 


>>> Application: alphabet soup 
In how many ways can you arrange (permute) the letters of the word STA- 
TISTIK (German for “statistics”)? 


Fig. 9.22 alphabet soup 


® Remark: The word consists of 9 letters, but only the letters A and K appear once. 
I and S occur twice, and T occurs three times. If we choose the position of A in 
nine ways (9 over 1), the position of K can then be selected in 8 ways (8 over 1). 
The two Js can be arranged within the remaining 7 positions in 7 choose 2 ways. 
Meanwhile, the two remaining S have only 5 possible positions left for 5 choose 2 
ways. Now, only 3 positions are free, and they necessarily have to be occupied by 
the three T (3 choose 3 possibilities, thus, exactly one). The number of possible 


words is, thus, 
s=9-8-(5)-(3) = 72-21-10 =15,120. 


We can also write the result as 
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1!-1!- 21-3) 


This result has a much easier interpretation. Indeed, it suggests a general formula 
in which the permutations appear without repetition in the numerator, while the 
denominator contains the products of the permutations of the individual repetitions. 


@ << 


Binomial coefficients have remarkable properties 
From the definition, we immediately conclude 


(!)= (04) 


This is useful when k > n/2. So, we get 


100 100\ 100 99 
(oe) ( 9 )= 22.2 -50-99= 4,950 


The binomial coefficients n choose k can be calculated recursively. First, by 


definition 
O\_ 0! gp n\ [n\n 24 
oO} o!-(0-0)! ” \n}J \O} n!-or 


and furthermore, 


n+1\ (n 4(.” Se la) 
Rai) le isi recursion formula). 


>>> Application: a rule for the coefficients 

Pascal’s triangle can be used to determine all binomial coefficients up to 
the order n. According to the recursion formula, we can find the entries of 
Pascal’s triangle by simple additions. Each new number (starting from the 
third line) is the sum of the two numbers above it. 
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1 6 15 20 15 6 1 
1 7 21 35 385 21 7 1 


Fig. 9.23 All cans must be on 
the shelf. Also a kind of Pascal’s 
triangle! 


Application to “classical” gambling games 


>>> Application: lottery, the second 

On p. 463, we calculated the probability of the lottery-6. There are 6 numbers 
to be taken out of 49 or 45, whereby the order plays no role. In principle, this 
example can be handled like the third example on p. 463 with the formation 
of teams of size k out of n people. Consequently, we have 


(*] 49-48-47-46-45-44 
sS= = 


x 14 million 
6 6-5-4-3-2-1 


ways to draw the numbers. 

Now, we can expand the example further and ask: What is the probability 
of guessing a lottery-5, lottery-4, etc.? 

The 49 numbers are divided into two groups: one group of 6 correct numbers 
and a second group of 43 false numbers. In order to make an accurate guess 
of r correct numbers, r numbers must be guessed from the set of correct 
numbers and 6 -r from the set of false numbers. This can be done in 


() (c=) 


ways. For r = 4, we have, for example, 


6\ (43 6\ (43\_ 6 42 
son (2)-()(0)-(2) = 8.5-45-28 ors 


possibilities and, therefore, a probability of 
s4/s » 14,000/14, 000, 000 » 1/1, 000, 
to guess a Lottery-4. 


With r = 0, we can calculate the probability that no single number has been 
guessed correctly. We have 


6\ (43 aa: 
59 = () ( 6 x 6.1 million, 


—— 
1 


which results in a probability of so/s * 0.44. <<< 
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>>> Application: sweepstakes, the second 

On p. 457, we considered in how many ways a sweepstake-11 can be achieved. 
The result: 2-12 = 24 possibilities (in the wrongly guessed game, two wrong 
answers are possible). With the current knowledge, we would say: two times 
12 choose 1 combinations. With our formula, we have much greater flexibility. 
The number of 10s (exactly 2 wrong tips) is 2? times “12 choose 10” = 264, 
and the probability of guessing 10 games correctly is 264/ 37 3 1: 2,000. 

® Remark: For an insider — in contrast to the lottery — the probability for a 10 is 
usually greater. For some soccer games, the result is clear anyhow, because one of 
the two teams is usually more likely to win. <<< 


Combining systematically 


We have already learned five different ways to group n different elements of 
a set into a class: 


e We can arrange the elements in groups of n. The order of the elements plays 
a role. You can exclude repetitions or not. This process is called permutation. 


1. Permutations without repetition 
The number of possible ways to arrange n different elements of a set is 


P(n)=n!. 


>>> Application: three-letter words 
From three letters a, d, and i, we can build 3! = 3-2-1 = 6 different words: 
adi, aid, dai, dia, iad, and ida. <<< 


2. Permutations with repetition 
If k elements occur several times in the set, then the number is reduced to 


n! 


PW (n) = : 
>>> Application: rolling four dice 

Rolling four dice results in 1, 1, 2, and 2, giving a total of 6. You can roll the 
sum 6 in 4!/(2!2!) = 6 ways with exactly the same number of eyes. <<< 


e Furthermore, we can combine elements of the basic set to classes with k <n 
elements, with the order of the elements either being important or not. We 
then speak of variations. 


3. Variations without repetition 
Consider the following problem: You should select & elements from a set with 
n different elements so that no element occurs more than once and the order 
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plays a role. This can be done in 


n!\ 


(n hl 


Vm,k) ane (nT). (et k4 1)= 


ways. 
>>> Application: two-digit numbers (1) 

How many two-digit numbers can be formed from 1, 2, and 3 if each digit 
can only occur once? The solution is: {12, 13, 21, 23, 31,32} which is 


3! 


VG?) aon 


3:2=6 
combinations. << 


4. Variations with repetition 

Consider the following problem: Again k elements from n elements are to 
be selected. The order is important, but the elements may occur more than 
once. This can be done in 


Ve(n,k) =n-n-...-n=n* 
—— 
k times 
ways. 
>>> Application: two-digit numbers (2) 


There are 3? = 9 two-digit numbers that can built from 1, 2, and 3, namely 
11, 12, 13, 21, 22, 23, 31, 32, and 33. Phe 


e If the sequence matters, one speaks of combinations. We have already dis- 
cussed the case when elements in the selected class cannot be repeated. 


5. Combinations without repetition 


>>> Application: mixing colors 

How many (RGB) mixtures can be composed from Red, Green, and Blue in 
the ratio 1:1? The mixed colors are RG, RB, and GB, which is K(3,2) =3 
possibilities. <<< 


6. Combinations with repetition 
A sixth and final case is still pending: The elements in these classes may be 
repeated. 


>>> Application: bouquets 

How many combinations can be picked from yellow, white, and violet flowers 
as a gift to your sweetheart (Fig. 9.24)? 

Solution: 

First, we find all possible combinations by simple enumeration: ggg, ggw, 
guu, WwuU, WWU, WVU, VU, Vg, Vgg, and vgw. So, there are 10 combinations 
of three flowers that differ in color. 
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Fig. 9.24 Left: A sufficiently large basic amount of three different flower varieties 
is given. The binding of a small bouquet can be interpreted as a combination with 
repetition. Right: The field violet is the “joker” in our concept and can “transform” 
into each of the three flower varieties (white, yellow, violet). Formula 9.8 can thus 
be explained. 


If there are enough flowers available, we can also bind much larger bouquets 
(kK is arbitrary). In other words, k may be greater than n. With n = 3 (3 


varieties), we assume that there are enough representatives of each variety. 
ddd 


The formulas for combinations with repetition can also be easily understood: 
Let us imagine n different elements and k-1 different “jokers” in the basic set 
(see Fig. 9.24, right). From this set, there can be n+k-1 over k combinations 
without repeating the k elements in each. The jokers can subsequently “turn” 
into any of the n elements, thereby capturing all possible repetitions. More 
specifically, there are also n cases in which one element is repeated k times. 
We have already found the number of combinations with repetition: 


a) (0.8) 


K® (n,k) = ( ; 


We check this formula by means of the above example with the flower bou- 
quet. We need one flower of each kind and in addition two field violets as 
jokers which yields a total of five flowers and allows the combinations ggg, 
ggw, guw, www, ww, wv, vvvu, vug, vgg, and vgw. This results in 


9 = — =10. 


= aa 
KR(3,3) = te : e 
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9.6 Fallacies, traps of reasoning, and apparent 
contradictions 


In the previous sections, we have already worked out a lot of the calculations 
of “classical” probabilities. Routine tasks can be solved without difficulty. 
Nevertheless, one should generally be aware of the possibility that there may 
be some ideas or supposed contradictions (paradoxes) that must be avoided. 


® Remark: Fallacies usually occur when a tried and tested mechanism of thought 
leads to erroneous results. The only way out is to be prepared for it — just as one, 
for example, can escape optical illusions by applying a ruler. Thus, for instance, 
the question of whether London is further North or farther South than Berlin is 
answered as “further North” by the majority because England “as a whole” lies 
further North than Germany. ® 


>>> Application: the false queue 
Why do most people feel that they are standing in the longer queue? 


AV Uh. ere j 


eee — 


Fig. 9.25 When cycling in hilly terrain, it always seems to go uphill. No wonder, 
you need twice as long for the same route as when you go downhill. 


Solution: 

Queues as a whole are usually of equal length, because every newcomer wants 
to minimize his/her own waiting time. Nevertheless, most people feel that 
they have picked the wrong queue, which is, of course, paradoxical. The 
apparent contradiction resolves itself when one considers the following: If 
you were actually in the wrong queue, you would have to wait longer. Should 
we consider a randomly chosen situation among all experienced “queuing 
situations”, then the possibility of standing in the longer queue is higher. 


® Remark: The “red traffic light effect” (the feeling that one constantly encounters 
a red traffic light) is comparable. There is also a similar effect in cycling. If the 
terrain is hilly, it almost always seems to go uphill. This also has to do with the 
choice of the time: Going downhill is much faster than going up. That is, it takes 
more time to cycle the same trajectory uphill. @ <<< 
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The gambler’s fallacy 


A classic illusion which has ruined many a man is known as the gambler’s 
fallacy. 

The fallacy occurs when someone believes that something “which has to occur 
in the long run” will occur the next time. One could also put it this way: 
An event that was expected to happen has not occurred for a long time. 
Therefore, it is assumed to happen soon. 

We distinguish two cases: the first concerns events which are independent of 
each other. 


>>> Application: sometimes 6 comes 

In a game of dice, there has not been a six 10 times in a row. So, with every 
new throw of the dice, it is believed that the 6 is more likely to appear. 
Likewise for the coin toss: “If heads has appeared 10 times in a row, then the 
probability of tails has to increase, because on average, tails appears every 
other time and the long series of heads has to be compensated...” 
However, neither dice nor coins have a “memory”, and what was thrown before 
does not affect what the next roll will be. <<< 


The second fallacy relates to events that depend on each other (or on a third 
event). 


>>> Application: statistically speaking ... 

The soccer team of a certain nation has always participated in the four-year 
World Championship and has been in the finals every third time for the 
last 36 years. The last two times, the team was not in the finals. So, the 
probability of entering the finals this time is believed to be extremely high. 
The team, however, can be in a bad shape and the series of victories can end. 
There is probably no mathematical law behind this. <<< 


The Saint Petersburg paradox 


Even if you are well aware that a coin cannot remember whether it previously 
landed on heads or tails, there is still hope that the following — paradoxical 
— strategy might work: 

>>> Application: Somehow the money runs out ... 

It is understood that for a coin toss, a double bet is paid out if one guesses 
the right side. Now, one plays with this system and keeps insisting that the 
coin will land on tails. If tails does not appear, you double your bet and bet 
on tails again. After the k-th toss, if it finally lands on tails, you will always 
have a profit. To see this, we only need a small calculation: Let EF’ be the base 
bet. If tails shows up, 2- F is paid out giving a net-profit of E. Otherwise, 
2-F is bet again. If tails appears, 4-F is paid out with an investment of 3-E. 
So, you have again a net profit of EF. If one has already doubled k- 1 times, 
the investment is 


(0 ko Ba" 1). Wao". we. 
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If tails shows up, 2". E gets paid out and your profit is again LE. 

If you play the game all night long, again and again, it adds up to a lot. It 
may be seen as a form of “work”, for which one gets money between his/her 
wages. You cannot lose in any case! 

Of course, you can lose, at the latest once the stake has grown so high that 
you do not have enough money to pay it. If you bet F = 1 € per game, then, 
after only 20 tosses showing heads, the stake is at least already 279-1 > 
1,000,000 €. It is no comfort that the probability that tails first appears in 
the 21st toss is only 1/279 » 1: 1,000, 000. <a< 


>>> Application: infinite bets 

The above example has become famous in a modified form through Daniel 
Bernoulli’s idea from 1738. The following gamble is offered in a hypothetical 
casino in Saint Petersburg: 

A participation fee is required. Then a coin is tossed until tails falls. This 
completes the game. The profit depends on the total number of coin tosses. If 
there was only one toss, the player would receive one ducat. For two throws, 
the player would receive two ducats. For three throws, the prize is four ducats, 
and so on. So, you get 2*-! ducats when the coin is thrown k times. 

We calculate the expected value for the profit by including the associated 
probabilities for paying out the respective profit sum: 


Sok if pT 7 


G= eA 
a a a 


bt... = OO. 


Theoretically, the expected value is oo, and the bank is not able to set the 
bet high enough. Otherwise, it would be bankrupt sooner or later! One way 
to escape the dilemma is, of course, to set a maximum number of tosses m. 
Then, the bank would have to set a basic bet of at least m/2 ducats per 
player. 

This example has become famous mainly because it has sparked a series of 
proposals for its solution, though only Bernoulli’s solution shall briefly be 
mentioned here. He wrote: “The calculation of the value of an object need 
not be based on its price, but rather on the usefulness that it possesses 
.... Undoubtedly, a profit of 1,000 ducats is more significant for a beggar 
than for a wealthy man, although both receive the same amount.” Therefore, 
Bernoulli suggested to distribute only In2" ducats instead of 2k ducats. As 
a result, the expected value for the profit is no longer infinite 


= Sin(2h') & (k-1)In2 
7 ‘e Ok 7 » dk 
k=1 k=1 


ea (K-11) 
al, TE = In2(1/2? + 2/23 + 3/244...) < 00. 


=1 (without proof) 
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The 50:50 error 


>>> Application: as if: “no divulging of secrets” 

Suppose that three candidates A, B, and C have applied for a post. A is 
waiting alone behind the door where the committee is making its decision 
and thinks: “There is a 1/3 probability that I will be accepted.” When a single 
member of the jury leaves the room, A asks him or her: 

“IT know you cannot tell me whether I have been accepted or not. But it is 
clear at least that either B or C was not accepted. Tell me which of the two 
was not accepted. Then, you would not tell me a secret, and I would have no 
additional information about my situation. I promise to keep the information 
to myself only.” 

The jury member is convinced and says “B was not accepted”. Thereupon, 
A beams and says: “Now the probability that I will be accepted has risen to 
1/2.” Is that correct? 

The answer is not trivial: A is not quite right (his probability of 1/3 has not 
changed), but the probability that C gets the position has actually doubled! 
To understand this, consider the three possible decisions of the jury (+ or 
— in the first position indicates whether A gets the position or not, etc.): 


1 1 1 
T 5 T 5 Ts: 


All three are equally likely. Judging by the committee’s verdict, case 2, which 


would have been one of the two unfavourable decisions for both A and C, is 
no longer valid. 

However, the difference between A and C is now that the question posed by 
A did not refer to A, but to B and C. Therefore, nothing changes for A. The 
probability of C not being accepted is halved. 

To understand this better, consider the notorious “three-door problem”. «<< 


>>> Application: three doors and two strategies 

Just imagine the following situation in a game show, just before the goal: 
One must “only” point to one of three doors hiding the main prize (a sports 
car). However, behind two of the doors, there are merely consolation prizes. 
To make things even more annoying, if you point to the wrong door, the 
consolation prizes hiding behind these doors are goats (blanks). That is how 
it worked in Monty Hall’s show “Let’s Make a Deal”. The host chimed in 
an interesting addendum: Once the candidate picked a door, the host, who 
knew exactly where the car was hidden, would make the candidate insecure 
by opening one of the doors hiding a goat. Then, he would offer the candidate 
(sighing with relief as there is now a 50: 50 chance) the possibility to change 
his/her initial choice. What would you do in such a situation? 

Solution: 

If you have the choice between a goat and a Porsche, you will probably invest 
some thought into figuring out a solution to this seemingly simple problem. 
Hundreds of people have expressed their curiosity about this problem on the 
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Internet and elsewhere. The answer is actually quite simple if one proceeds 
according to the following reasoning: 


I arbitrarily choose one of the three doors. If I happen to pick the door hiding 
the sports car, this is good. If I pick a door hiding one of the two goats, this 
is bad. The probability of winning the car is 1/3. If I stick with my choice, 
then this probability will not change. Not even if the host shows one of the 
two goats to me: He will do so in any case — regardless of whether I have 
picked the car or one of the two goats. 


=a 


a = 


K 


‘ 


Fig. 9.26 Car or goat? The question has provoked much discussion! 


Let us now consider the strategy of changing the initial choice “in any event”. 
There are three cases: If my initial pick is the sports car, this is bad for me. 
If I pick goat 1 or goat 2, I am lucky now: In both cases, I will eventually 
point to the door hiding the highly anticipated car. Thus, I have doubled 
the chance to win! So, the information provided by the host was helpful and 
should not be dismissed as creating only uncertainty. Strategically, picking 
the third door is the much better choice! 


® Remark: At least in the original version of the game, it was possible for the host 
to open the door initially picked by the candidate if that door was hiding a goat. 
In this case, the candidate has a 50:50 chance of picking the correct door. 

In the show “Who wants to be a Millionaire?”, the “Fifty-Fifty-Joker” randomly 
removes 2 of the 3 incorrect answers. The decision of the computer is in no way 
influenced by the candidate’s presumption. So, theoretically, the probability that 
the candidate will pick the correct answer now is exactly 50:50. ® <<< 


Computer simulations help to review considerations 


The “three-door problem” is a classic example illustrating how very smart 
people may proudly present different results. To determine who has found 
the right solution, you could spend an evening repeating the same game: A 
host hides a pea randomly under one of three thimbles (by secretly throwing 
a die, for instance). Player A taps on one of the thimbles, and the host then 
lifts a second thimble (hiding no pea). A second player B now lifts the third 
thimble. If B has guessed the correct thimble, B will keep the pea. Otherwise, 
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it is A who will, logically, get the pea. After only a few dozen experiments, 
it will become clear that B has obtained significantly more peas than A. 

If we can code well and write a computer program that works with random 
numbers and carries out all the steps in the prescribed way, we can simulate 
the situation a million times and evaluate it accordingly. The point ratio will 
almost exactly be 1:2 for a large number of repetitions. Here is the output 
of a corresponding program (n is the number of iterations; the process is 
repeated several times so we can see that the taken numbers are chosen 
randomly): 


= 10: NO CHANGE good: 30.00%, CHANGE good: 70.00% 

= 100: NO CHANGE good: 38.00%, CHANGE good: 62.00% 

= 1000: NO CHANGE good: 33.40%, CHANGE good: 66.60% 

= 10000: NO CHANGE good: 34.44%, CHANGE good: 65.55% 

= 100000: NO CHANGE good: 33.17%, CHANGE good: 66.82% 

= 1000000: NO CHANGE good: 33.28%, CHANGE good: 66.71% 
= 10000000: NO CHANGE good: 33.34%, CHANGE good: 66.65% 


= 10: NO CHANGE good: 30.00%, CHANGE good: 70.00% 

= 100: NO CHANGE good: 22.00%, CHANGE good: 78.00% 

= 1000: NO CHANGE good: 35.00%, CHANGE good: 65.00% 

= 10000: NO CHANGE good: 33.49%, CHANGE good: 66.49% 

= 100000: NO CHANGE good: 33.57%, CHANGE good: 66.42% 

= 1000000: NO CHANGE good: 33.41%, CHANGE good: 66.58% 
= 10000000: NO CHANGE good: 33.36%, CHANGE good: 66.63% 


= 10: NO CHANGE good: 40.00%, CHANGE good: 60.00% 

= 100: NO CHANGE good: 32.00%, CHANGE good: 68.00% 

= 1000: NO CHANGE good: 34.10%, CHANGE good: 65.90% 

= 10000: NO CHANGE good: 33.16%, CHANGE good: 66.84% 

= 100000: NO CHANGE good: 33.23%, CHANGE good: 66.77% 

= 1000000: NO CHANGE good: 33.31%, CHANGE good: 66.69% 
= 10000000: NO CHANGE good: 33.33%, CHANGE good: 66.67% 


= 10: NO CHANGE good: 20.00%, CHANGE good: 80.00% 

= 100: NO CHANGE good: 31.00%, CHANGE good: 69.00% 

= 1000: NO CHANGE good: 34.20%, CHANGE good: 65.80% 

= 10000: NO CHANGE good: 33.94%, CHANGE good: 66.06% 

= 100000: NO CHANGE good: 33.33%, CHANGE good: 66.67% 

= 1000000: NO CHANGE good: 33.31%, CHANGE good: 66.68% 
= 10000000: NO CHANGE good: 33.34%, CHANGE good: 66.65% 
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Aggregation traps: “half the truth” 


On a daily basis, we are overwhelmed with statistics in newspapers and ma- 
gazines that are “simplified” for the presumed sake of making them ‘more un- 
derstandable”. Yet, things are not always simple, and such statistics may — 
intentionally or not — end up presenting the wrong information. That is why it 
is often colloquially said that ‘everything can be proven with statistics”. 


Aggregation is the condensation of individual statistical statements to general 
statements. The non-specific aggregation of counted data creates connections 
which do not exist in the original statistics. The first of the following examples 
is “unproblematic”. The second is tendentious: 
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>>> Application: murders and death sentences in Florida 


skin color of the death sentence sum share of death- 
perpetrator yes/no sentences in % 
black 59/2,448 2,507 2.4% 
white 72/2,185 2,257 3.2% 


When looking at the above statistics (New York Magazine, March, 11, 1979) 
the answer seems clear: Being white seems to be a disadvantage. 

The matter looks quite different when another variable comes into play, na- 
mely the skin color of the victim: 


skin color death sentence sum share of death- 
perpetrator / victim yes/no sentences in % 
black / black 11/2,209 2,220 0.5% 
white / black 0/111 111 0.0% 
black / white 48/239 287 16.7% 
white / white 72/2,047 2,146 3.4% 


Now, we are suddenly faced with a completely new situation: Regardless of 
the victim’s skin color, blacks are more likely to be condemned to death than 
whites, especially if the victim is white. <<< 


>>> Application: division of work 

Suppose work W is evenly distributed among two people A and B. On the 
first day, A performs 60% of his daily routine, while B performs 90%. The 
next day, A performs only 10% of his work, while B performs 30%. In both 
cases, B achieves a much higher percentage — but that does not mean that 
B has done more work. 


Numerical example: A could have split his/her share in the ratio 10:1, and B in 
the ratio 1: 10. So, on the first day, A works 60/100-10/11-W/2 = 600/2, 200-W, and 
on the second day he/she works 10/100-1/11-W/2 = 10/2,200-W. B, on the other 
hand, works 90/100-1/11-W/2 = 90/2, 200-W and 30/100-10/11-W/2 = 300/2, 200-W. 
Now, the work performed by the two is 61:39 in favour of A! <<< 
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9.7 Probability distributions 


The binomial distribution 


>>> Application: vocabulary test (Fig. 9.27) 

You have learned p = 5/6 of the vocabulary assigned for a spelling test and 
are eventually asked to spell n random words from that vocabulary. What 
are the chances that you know & of them? What is the probability that you 
know at least 18 out of 20 words? 


A P(k) 


20% 


-9-9-0-9-9-0-9-0-0 
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Fig. 9.27 How many words do we know? 


Solution: 

Suppose you know the first & words, and not the remaining n — k. For this, 
the probability is p*-(1-p)"-*. You need not know the first word in order to 
spell a total of & words correctly. There are n choose & combinations. Thus, 
the general formula is 


P(k) = (;' pf (— py. 


Fig. 9.27 shows a diagram for n = 20. It shows that the probability of knowing 
exactly 18 words out of 20 is about 20%. There is a 10% probability of 
knowing 19 words, and only 3% of knowing 20. In total, the possibility of 
knowing at least 18 out of 20 is 1/3, ie. 33%. << 


>>> Application: the Galton board 

When balls fall from a funnel onto a board of nails as shown in Fig. 9.28, 
they distribute themselves in a bell-shaped manner at the lower end. This is 
a sequence of adjacent 50 : 50 decisions (p = 1 - p = 0.5). 
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Fig. 9.28 formation of a bell curve in a Galton board 


The distribution of the balls is ideally like a bell-shaped “stepped polygon”. 
Fig. 9.29 and Fig. 9.30 illustrate a Galton board for 10 and 20 rows of nails. 
Better estimates for the polygon can be made when we increase the number 
of dropped balls. 


P(r) 


20% 


5 10 20k 


Fig. 9.29 binomial coefficient n = 10 Fig. 9.30 binomial coefficient n = 20 


Sums of probabilities 


When it comes to the probability of exceeding or falling under a certain 
limit, one has to sum up individual probabilities (in Application p. 482, the 
question was about the probability of passing a vocabulary test where at 
least k out of n words are known). If, as in Fig. 9.31 and Fig. 9.32, it is not 
the probabilities but the sums of preceding probabilities that are applied, it 
is easy to determine when such a barrier is exceeded. 


>>> Application: resolute minorities 

An association of 25 members will vote on a number of topics by mail. In the 
case of a seemingly irrelevant proposition in which most members will more 
or less randomly tick yes or no, five of the members have secretly decided to 
vote no. What is the probability that the proposition will still pass based on 
a majority voting in favor? 

Solution: 

A total of 13 opposing votes is required for a rejection of which 5 are already 
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Fig. 9.31 summation probability n= 10 Fig. 9.32 summation probability n = 20 


guaranteed. Thus, P is the probability that, out of the 20 undecided members 
(p = 1-p = 0.5), a maximum of 7 will vote no. There is a (symmetric) binomial 


7 
distribution and P= ¥ -0.5* -0.520-* » 0.13. In Fig. 9.32, this value is 
k=0 
found at the point k = 7. The probability that the resolute minority rooting 
for a no will win is 87%. aaa 


>>> Application: crackling rain (Fig. 9.33, Fig. 9.34) 

Raindrops strike at random. Nevertheless, there is a locally constant amount 
of rainfall per square meter during a strong thunderstorm. That amount may, 
for example, be 1 litre (~ 6,000 drops) per minute. Thus, we have an average 
of 100 drops per second on a square meter. The impact area shown in the 
picture is 1/30 m?. We expect there to be 100/30 ~ 3 drops per second. What 
is the probability of seeing the impact of two (or three) drops simultaneously, 
as in the photographs below? 


Fig. 9.33 two simultaneous drops Fig. 9.34 three splashes 


Let us, first of all, arrive at a reasonable estimate by using common sense, 
especially since we have not made precise assumptions. 

An initial assumption is that the impact of each drop will be visible for a 
fraction of a second in the form of an “explosion cone”. If this was not the 
case, one would almost never see the desired impact at short exposures (in 
this specific case, 1/125 second). The probability that one sees the desired 
impact at one drop per second is assumed (as an empirical value) to be 1/5. 
The probability of not seeing it (complementary event) is 1-1/5 = 4/5. 
Secondly, we assume the impact of exactly three drops every second. The 
probability of not seeing any of these drops is P(0) = (4/5)? » 0.51. The 
probability of seeing exactly one impact is determined as follows: Suppose 
that we see the impact of the first drop, but not that of the other two. For this, 


9.7 Probability distributions 485 


we have the probability 1/5-4/5-4/5. The visible drop can be varied in three 
ways, so that exactly one drop can be seen with P(1) = 3-1/5-4/5-4/5 = 0.38. 
The probability that the first two drops can be observed at the moment of 
their impact but not the third equals (1/5)?- (4/5) = 4/125. Since it does not 
matter if we see drop 1 and drop 2 or drop 1 and drop 3 or drop 2 and drop 3, 
this probability is P(2) = 3-(4/125) x 1/10. On average, we see simultaneous 
impacts of two raindrops in every tenth picture. In fact, out of about 50 
taken photographs, only five of them show what may more or less qualify as 
a double impact (see also Application p. 216). 


® Remark: The probability of seeing three impacts at the same time is P(3) = 
(1/5)? = 1/125, ie. just under 1%. According to our highly simplified model, there 
would be no possibility of four or more visible impacts (P(4) = 0). It might oc- 
cur in practice but it is extremely unlikely. This already shows: We need a more 


sophisticated model for probability distributions. ® <<< 


The Poisson distribution 


In the previous application, it was, first of all, necessary to find an average 
expected value A for the number of raindrops. We set it to be 3 drops per 
second. Now consider Fig. 9.35: If someone told you that he/she had analysed 
1,000 attempts exactly and had thus come to this distribution, you would 
probably believe him/her, because the distribution looks realistic. 

Let us relate this now to Fig. 9.36: If you have determined that you can 
photograph the impact of an average of = 0.6 water droplets, then the dis- 
tribution shown also seems plausible. It agrees fairly well with the estimated 
values of 55%, instead of 51%, for zero visible impacts; 33%, instead of 38%, 
for one visible impact; 10% for two visible impacts; and 2%, instead of 1%, 
for three visible impacts. 
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Fig. 9.35 raindrops per second 


P(r) & 
55% 


Fig. 9.36 drops captured on the photo 


Why do we compare the values so extensively? Well, the two “plausible” dis- 
tributions were calculated with the following simple approximation formula 
by Poisson 
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k 
P(k\= a e, expected value A constant. (9.9) 


Poisson looked at questions such as the “raindrop problem” in a more general 
manner, and showed that his formula approximates the binomial distribution 
for a large n, small p, and a constant A = np. He has thus saved us a lot of 
work in the calculation of similar examples. 


® Remark: P(0) = e*, P(1) = Ae, P(2) = aes etc. are special cases. Using 
Formula (8.5), it is easy to see that the sum of all probabilities P(k) equals 


SAP Xr pene A. 3A 0 
2, 7 ree eae = ®@ 
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>>> Application: color-blind 

Tests have shown that every one hundredth person is color-blind. What is 
the probability that there are at least two color-blind people among 100 
randomly selected persons? How many people have to be tested to find at 
least one color-blind person with a probability of no less than 95%? 


Solution: 

The expected value is \ = 100(1/100) = 1. The opposite of “at least 2” is 
“Q or 1”. We have P(0) + P(1) = e1(1+4+ 1) » 0.74, and therefore, P(> 2) » 
1-0.74 = 0.26. 

The opposite of “at least one” is “none”. The complementary probability of 
P(0) < 0.05 is P(> 0) > 0.95. For n tested people, the expected value is n/100 
and P(0) = e-"/1 < 0.05 has to hold. Therefore, we have —n/100 < In 0.05 
and n > 300. da< 


>>> Application: significant increase? 

A person is investigating a report from the Internet: “Strange accumulation of 
suicides in Aargau (Switzerland) since the beginning of the year. In January 
and February, 18 men and 5 women committed suicide in Aargau. In light of 
these numbers, even the Cantonal police of Aargau begin to wonder what is 
happening. In the year 2001, the statistics for the Canton of Aargau records 
95 suicides. On the 20th of February alone, 4 people committed suicide.” 


Solution: 

For 2001, we have the expected value = 95/6 = 15.83 for 2 months. For 
k suicides in 2 months, we get p(k) = \*/k!-e-**. The probability that a 
maximum of 22 suicides occurs per week is 1329 p(k) = 0.947. Therefore, 23 
or more suicides are expected with a probability of slightly more than 5%. 
For a single day, A = 95/365 = 0.26. The fact that four or more people commit 
suicide in one day is actually very extraordinary, because p(< 4) = Dp, p(k) = 
0.99984. Nevertheless, the observation period extends over two months, and 
therefore, we have a probability of 1 — p(< 4)® ~ 0.001 for such an extreme 
accumulation. 
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® Remark: The latter was probably the trigger for the alarming report. 2001 could 
have been a year with fewer suicides than usual. If the number of total suicides 
considered to be “normal” had been, for example, 110 and only two people had 
taken their lives on the 20th of February, then 21 = 23 - 2 suicides would not have 
been considered extraordinary. 
In Germany, the number of annual suicides fell below 10,000 for the first time in the 
year 2006. There is no evidence that days of full moon or weekends lead to higher 
suicide rates. Only the seasons have an influence (fewer suicides are committed 
during summer and autumn). This ties in with the above report from Switzerland. 
The ratio of male to female is not uncommon either. In Germany, it is almost 3:1. @ 
ddd 


The central limit theorem 


>>> Application: rolling many dice 

In Application p. 456, we answered the question: “Why does the sum 10 
appear more often than the sum 9 when rolling three dice?” Let us now 
clarify the question in a different way: If we throw a die, it is equally probable 
that it will show 1, 2, 3, and so on. The average number of eyes will be 
(1+2+...+6)/6 = 3.5. If we throw three dice, the average number of eyes will 
approach 10.5 in many experiments. The number 10 is closer to this average 
value than 9. We will soon see that numbers closer to the expected value also 
occur more frequently. <<< 


Let us imagine an “electronic die” that produces integers ranging from 1 and 
6 (each with p = 1/6). If we roll n dice simultaneously, then a maximum of 
u=n-p of 6 is to be expected. 
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6 dice 
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Fig. 9.37 simultaneously throwing sever- Fig. 9.38 ...and the number of rolled 6 
al “electronic dice” ... 
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If the number of dice is increased, the probability distribution will begin to 
change into a bell-shaped curve. The curve becomes steeper the more dice are 
used. In other words, the “outliers” remain even closer to the expected value 
jt when n increases. The area under the curve remains constant, because 
all probabilities sum up to 1. Without proving this, we formulate relatively 
generously: 
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Fig. 9.39 We raise the number of dice... Fig. 9.40... and reach a bell curve. 


Central limit theorem: The sum of n identically distributed, independent random 
variables X obeys for a large n a “normal distribution’, i.e. the Gaussian bell 
curve (expected value = n- jz with variance no2). This is especially true 
for the binomial distribution, which indicates how frequently an event with the 
probability p occurs at an n-fold execution. Here, we obtain p = np; 07 = 
np(1 — p). 


® Remark: We have already discussed the prototypes of the curve (f= 0, ¢ = 1) in 
Formula (8.1). In general, the curve has its maximum at jy and the inflection points 
are at y+o. The shape of the curve obviously depends on n and p. ® 

In practice, the following addition is important: 


For a normal distribution, approximately 2/3 of all occurring values are in the 
interval y+ a. 95% are in the interval y+ 2c. 99.7% of all values are in the 
interval js + 30. 


a © 


Fig. 9.41 different variances 


Fig. 9.41 shows bell curves that bulge in different manners. What they all 
have in common is that the area underneath the curve equals 1, namely the 
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total sum of all probabilities. The area between —co and z is given by 


pa mp 
Ps) = [ eC)” de. 


oV 2 


This integral cannot be evaluated in terms of elementary functions, but it 
can be evaluated numerically, as we have seen in Application p. 417. 


>>> Application: quality control (Fig. 9.42) 

A light bulb manufacturer claims that a maximum of 2% of the delivered 
goods are defective. In a test of 100 bulbs, 5 were defective. Is this contra- 
dictory to the factory specification? 


Solution: 

We have p = 0.98 and n = 100. Thus, = 98 and 0? = w(1-p)* 250814. 
The measured value 95 deviates by 3 > 20 from the expected value. This 
means that the factory data is not correct, as we get a probability of more 
than 95%. <<< 


>>» Application: systematic overbooking (Fig. 9.43) 

Aircraft are usually overbooked because experience has shown that not all 
passengers board the plane. An airline generally overbooks a small aircraft 
by 1 seat and larger ones by 3 seats because 5% of all booked passengers 
usually do not appear. What is the probability that there will be a shortage 
of seats with a seating capacity of 50 or 200? 


Solution: 

On the small plane, n = 51 and p = 0.95 result in pp = n-p = 48.45, 0? = 
48.45 -0.05 => o » 1.56. The maximal possible value is 50 ¥ +o. 68% of all 
values lie within +o. 16% still lie above 4+ 0 for symmetry reasons. The 
probability of a shortage is, therefore, relatively high. 

For larger aircraft, with n = 203, w x 193, and o » 3.1, we are already outside 
of the 20-range and on the safe side with 201 passengers. <<< 
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>>> Application: minimum sample size 

How many people must be tested so that there is a probability of 95% that 
the proportion of left-handed people can be determined with a precision of 
1%? 

Solution: 

Let p be the actual probability of left-handed people. If we examine n peo- 
ple, the sample has the expected value yp = np, and the distribution is 
a=\/np(1-p). 

Let S, be the number of left-handers and p, = S/n be the determined 
relative frequency of left-handers in the sample. Now, we require a deviation 
of less than one percent: |py — p| < 0.01. 


In order to remain in the 95% confidence interval, the following occurs: 
[Sn — ft] < 20 = [pn - p| < 2Vp(1 - p)/n. 


Therefore, we have 0.01 > 2\/p(1 - p)/n. After first knowing nothing about p, 
we have to replace p(1-p) by its maximum 1/4 to be safe: 0.01 > 2\/1/(4n). 
This results in n > 10,000. 

Now, however, we may know from smaller preliminary investigations that p 
is relatively small, namely about 0.1. This shrinks p(1—p), and we round to 
get n = 3,600. <<< 


>>> Application: uncertain election forecasts 
For a large party, we want to make a reliable prediction of the result with a 
precision of 1%. What is the size of the sample? 


Solution: 

According to the results of the previous application, we need 10, 000 inquiries. 
For a large party, p(1-p) is not much smaller than 0.25, so that there is no 
hope for limitation. Neither the selection of the people nor their honesty has 
yet been taken into account. 

If, as before, only 400-500 people are questioned by different opinion research 
institutes, only accuracies off by plus or minus a few percentage points can 
be expected. Often enough, however, it is only by a few percentage points, 
sometimes even by a mere few thousand votes, that a candidate gets elected. 
This shows that such surveys are not meaningful, and their results are stron- 
gly influenced by what political party stands closest to the research institute 
carrying out the survey. <<< 
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9.8 Further applications 


>>> Application: /t depends on the seed. 

In the computer age, many computations would be unthinkable without 
computer-generated random numbers. The essential task is the creation of 
a single random number, for example, by using the system time. Then, the 
remaining numbers are recursively arithmetically derived from this seed (“de- 
terministically”). This has the advantage that it helps us to understand the 
scientific knowledge that can be gained with such “pseudo-random numbers”. 


@® Remark: For programmers: A discussion of different algorithms — partially with 
source code of the C functions srand () and rand () — can be found at http:// 
www.codeproject.com/KB/cpp/PRNG.aspx. The mentioned functions produce uni- 
formly distributed 16-bit random variables. ® <<< 


>>> Application: How random are the digits of x and e? 

The circle number a and Euler’s number e are known for the fact that the 
order of their decimal places is not subject to any law (the numbers can, 
however, through recursion be calculated arbitrarily precisely). Fig. 9.44 and 
Fig. 9.45 show the distribution of digits in the two numbers for 100 and 


100,000 decimal places.? 
0 1 2 3 4 5 6 7 8 9 


Fig. 9.44 number of digits in 7 (dark Fig. 9.45 number of digits in e (dark 
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frequency in % 
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Even if it does not appear to be so “initially” — especially for Euler’s number, 
the number 7 appears to be more frequent at first — the numerical frequency 
converges rapidly, and it is already from 100,000 digits onward that an almost 
uniform distribution is achieved. 


® Remark: For a random sequence of digits, it is not sufficient that all digits occur 
equally frequently. Another possible investigation uses the so-called maximum test: 


3The files are from http: //www.arndt-bruenner .de/mathe/mathekurse. htm 
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The digits are grouped into triples, and it is checked how often the middle digit 
is larger than its left and right neighbour. If the middle digit is, for example, 7, 
then there are 7? - 7? possibilities for the other two digits. Among the 10-10-10 
possibilities of triple digits, 1? +2? +...+ 9? = 285 is the maximal middle digit. It 
is, therefore, possible to require that such a maximum occurs with the probability 
» 0.285 in the case of very long random sequences. In fact, for the digits of 7 and 
e, the values are 0.288 and 0.283 respectively at 100,000 digits — another indication 
of randomness. ® <<< 


>>> Application: uniform point distribution on the sphere (Fig. 9.46) 
It sounds easier than it is: How do you “dot” a polygon or a sphere randomly 
but evenly? 


Solution: 

On the polygon: First, for a point R, one must be able to determine whether 
or not it is inside the polygon P; (2 = 1,...,n). If the polygon is convex, 
one can, for example, use this test: Calculate the sum of all signed angles 
2P;RP,,, by means of Formula (5.36). If it results in 360°, the point lies 
inside (for points outside, the sum is 0). If we want to reach a given point 
density, the number of points must be proportional to the polygon area. To 
calculate the area, we triangulate the polygon and calculate the area of each 
triangle using Formula (5.44). Now, we look at the rectangle (picture on 
the left) circumscribing the polygon and select random points R(«/y) with 
random coordinates x and y in the rectangle area until the number of strikes 
in the polygon corresponding to the area is reached. 


Fig. 9.46 Distribution of random points (from left to right): polygon, sphere (bad 
because it is dependent on the parametrization), sphere (good because it is uniform), 
sphere (shaded), torus 


On the sphere: The parametrization % = (r cosucosu, rcosvusinu, 7 sinv) 
with 0 < u< 2m, -2/2 < v < 2/2 provides an arbitrary division into quadri- 
laterals or triangles of different sizes. Now, we only have to fill each polygon 
as described (third image from the left in Fig. 9.46). 

It is unfavourable to choose the parameters u and v randomly in their inter- 
vals, as it was done in the second picture from the left, because this results in 
an accumulation of points in the polar regions. If the incidence angle of the 
light also enters into the calculation of the number of points, then “shaded” 
images are obtained. This can be applied to all polygonal surfaces (rightmost 
image). 
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Fig. 9.47 application of uniform distribution of points on the sphere in technology 
and in nature 


® Remark: The even distribution of points on a sphere is also known under the 
label Thompson’s problem, which is concerned with minimizing the energy of n 
equal point charges on the sphere. For more information, see the book “Geometry 
and its Applications”. ® <<< 


>>> Application: male or female? 

The sexual proportion is defined as the ratio of the male portion of a certain 
age group to the female portion of the same age group. At the time of birth, 
it fluctuates between 1.02 and 1.12 (Fig. 9.48). Values that go above 1.10 are 
reached only by gender-specific abortions. The values in Europe are between 
1.05 and 1.07. We prefer mathematical notations like 1.05, because we must 
calculate to more decimal places. Statistics often contain information such 
as 1,050 (:1,000) or 105 (:100). In the following table,* P(m) denotes the 
probability of a boy’s birth: 


sexual- P(m) country births / sexual- P(m) country births / 
prop. thousand prop. thousand 
1.02:1 0.505 South Africa 18.30 1.06:1 0.515 Germany 8.25 
Greenland 15.93 Denmark 11.13 
Kenya 39.72 Sweden 10.27 
1.03:1 0.507 Nigeria 40.43 Czech Republic 9.02 
Mali 49.82 Greece 9.68 
Columbia 20.48 Cuba 11.89 
1.04:1 0.510 Finland 10.45 Bangladesh 29.80 
New Zealand 13.76 1.07:1 0.517 — Italy 8.72 
Laos 35.49 Spain 10.06 
1.05:1 0.512 Austria 8.74 Slovenia 8.98 
Switzerland 9.71 Bosnia 8.77 
Norway 11.46 Ukraine 8.82 
France 11.99 Vietnam 16.86 
USA 14.14 Malaysia 22.86 
Indonesia 20.34 1.08:1 0.519 South Korea 10.00 
Brazil 16.56 Venezuela 18.71 
India 22.01 China until 1980 - 
Iran 17.00 Singapore 9.34 
Japan 9.37 1.10:1 0.524 Taiwan 12.56 
North Korea 15.54 1.12:1 0.528 China 13.25 
Argentina 16.73 =1.06:1 0.515 Worldwide 20.05 
Fig. 9.48 sexual proportions of different countries << 


“Data downloaded from www.cia.gov/publications/factbooks/geos/ in 2010. 
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>>> Application: classification by birth order 

We want to give an invulnerable breakdown of sexual proportions for the 
first birth, second birth, third birth, and so on. Here, data from the USA 
was used (Fig. 9.49): Firstly, the US has a sex ratio that is typical for the 
world population. Secondly, the data is statistically secure because of their 
reliability and large numbers (about 2 to 4 million births per year). After 
all, there are 100,000 births of a 5th child per year. The data was extracted 
from huge files of the National Bureau of Economic Research.” 

In the US, there were approximately 50 million births in the period 1992- 
2003. It can be seen from the table that certain circumstances are fairly 
stable during this period. These circumstances include the average sexual 
proportion (1.05), the high number of data, and the number of children per 
mother. 


year 1. birth 2. birth 3.birth 4. birth 5. birth sum per 
1.-5. birth mother 
991 1.053 1.043 1.041 1.033 1.035 4.02 mio. 1.96 
992 1.054 1.051 1.046 1.040 1.040 3.97 mio. 1.96 
993 1.054 1.049 1.048 1.038 1.033 3.91 mio. 1.95 
994 1.052 1.047 1.045 1.039 1.038 3.86 mio. 1.94 
995 1.056 1.047 1.044 1.040 1.035 3.80 mio. 1.93 
996 1.050 1.047 1.043 1.043 1.036 3.80 mio. 1.94 
997 1.052 1.049 1.041 1.038 1.040 3.79 mio. 1.95 
998 1.050 1.048 1.044 1.037 1.041 3.85 mio. 1.96 
999 1.053 1.049 1.043 1.041 1.035 3.87 mio. 1.96 
2000 =1.054 1.048 1.045 1.038 1.030 3.97 mio. 1.96 
2001 =1.051 1.045 1.038 1.040 1.044 3.94 mio. 1.97 
2002 =: 1.055 1.048 1.040 1.040 1.031 3.94 mio. 1.97 
2003 = 1.054 1.047 1.049 1.037 1.031 3.94 mio. 1.97 


Fig. 9.49 sexual proportions in individual births (USA) 


Fig. 9.50 illustrates the table graphically. In purely visual terms, one can 
draw quite relevant conclusions. The plotted graphs are smoothed so that all 
values for each single birth remain in a simple confidence interval. At the 4th 
birth, and even more at the 5th birth, these intervals become larger due to 
lower birth rates. If you allow a confidence interval of 95%, then the curves 
appears almost linear. 

From this very reliable data, the following conclusion can be drawn, at least 
for the USA during the period of 1991-2003: 

The sexual proportion drops slightly but significantly from birth to birth. The 
average sexual proportion agrees fairly closely with the sexual proportion at 
the second birth. 

Of course, the latter applies only if a mother has 2 children on average. If the 
number of children per woman is limited by law to 1, the sexual proportion 
might be correspondingly higher. However, if the sexual proportion is slightly 


Shttp://www.nber.org/data/vital-statistics-natality-data.html 
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Fig. 9.50 sexual proportions in different births (USA, 1991-2003) 


dependent on the age of the mother and mothers with single children are 
slightly older when they give birth, then the proportion does not increase even 
with single children. Corresponding comparative values would have to cover 
huge samples and those would probably come from China. There, however, 
the data is falsified because more female than male foetuses are aborted. «<< 


>>> Application: life expectancy 

Let s(x) be the proportion of the population of a country that died at the 
age of x years, and f(x) be the portion that died at the age of at least x 
years (s(x), f(x) €[0, 1]). Considering these functions, one shows that the 
average life expectancy of a newborn is 


f- f(x) dx 


years (here, m * 100 is the statistically relevant maximum age of a person, 
compare with Application p. 287). 


Solution: 

Let s; be the proportion of people who — for instance, in the past three years 
— died at the age of i years. Since each person is assigned exactly one age, 
9 Si = 1. Further, according to the definition of f(x), 


m 

oo si = f(z). 

wz 
Obviously, f(0) = 1, f(m) =0, and f(x) is a monotonically decreasing func- 
tion converging to zero, which is, however, almost constant at first and de- 
creases rapidly “towards the end”. The functions s(a) and f(x) are, strictly 
speaking, “step functions” (“bar diagrams” with a bar width of 1). The area 
under these step functions can be calculated as the sum of the rectangular 
areas (rectangle width is 1) or, as with any other integrable function, as a 
definite integral: 


A= ri -l= [f@ dx (i integer, x real). 
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Fig. 9.51 death statistics and ie ae in hasan (www. statistik. at) 


@ Remark: As is clearly shown in Fig. 9.51, mortality is comparatively high in the 
first year of life (especially at birth). Then, it remains constantly low until the young 
become victims of road accidents. From about 35 and onwards, the mortality begins 
to rise slowly and reaches its peak at being just over 80. ® 


The average life expectancy e of a newborn can now be calculated as follows: 
e=0-spt+1-5,4+2-sgt--= 


=(s1+59+53+...)+(s9+53+...)+(sgt+s5qt...)+...= 
—— _.—_ 
(0) f() f(2) 


=O) +F) +I) += D=A= f" fa)ae: 


Thus, the life expectancy of a newborn can be interpreted as the area under- 
neath the “life step”. «<< 


>>> Application: remaining life expectancy 
As before, let the expression f(a) be the proportion of the population of a 
country that died at an age older than x years. Show that a person who is 
already a years old can expect an average of 


1 m 
éla)= Fla) [ f(x) dx 


remaining years to live and thus reach an average age of a+ e(a) years. 


Proof: For a = 0 (f(0) = 1), we have already proven the formula. If a person has 
already reached a certain age a, the person may already have survived the early 
death of other people and “pushed” the average age of death. This increases the life 
expectancy. 

This time, we restrict the group of people to those who are older than the person 
in question. Their proportion is defined by f(a). Further, let s; be the proportion 
of people who died at just i years old. If we now set # = x—a, then our person with 
& = 0 is, so to speak, a “newborn” with a remaining life expectancy of 
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Fig. 9.52 remaining life expectancy 


Thus, the remaining life expectancy can be interpreted as the remaining area under 
the “life step” multiplied by the magnification factor fia 

For the individual, a calculation is, of course, of little significance. However, when 
it comes to pension calculations or retirement benefits, such values are of enormous 
importance. 

Numerical example: Fig. 9.51 shows the graphs of the functions s(x), f(x), and 
x+e(a) for Austrian males in 1992. The average life expectancy of a newborn was 
73 years. A 60-year-old man could expect to become 78 — 79 years old. An 80-year- 
old could statistically even reach the age of 87 years. Only ten years later, all values 
increased by about 2 years. Remarkably, life expectancy for women is much higher 
(about 6 years). © <<< 


>>> Application: estimation by means of point clouds (Fig. 9.53, Fig. 9.54) 
In Application p. 292, we derived a formula for the linear fit which can be used 
to approximate an entire point cloud. Whether this is useful or not depends 
on how strongly or weakly the point cloud is “scattered”. Furthermore, the 
meaningfulness of the linear fit depends on whether the data (that is, the 
coordinates of the individual points of the cloud) are actually in a linear 
relation. 

Fig. 9.54 provides a typical example of where a linear fit is unsuitable, but 
a cubic fit gives remarkably good results. It concerns the (obvious) relation 
between body size and body mass or weight. In Chapter 2, we discussed this 
relation in detail (see, for example, Application p. 105). In order to prove 
the cubic relation (doubling the size leads to an eightfold increase in weight), 
50 male and 50 female adults (students and staff at the author’s university) 
were “measured” (Fig. 9.53). 

From these 100 measurements, the linear fit and the cubic fit were calculated. 
The linear fit (shown in red) is practically unusable outside the range of a 
150-200 cm body size. 


® Remark: The cubic parabola (shown in green) allows for a good extrapolation 
even far outside the interval. All the markings recorded in Fig. 9.53 that lie outside 
the measuring range are additional test values that were not taken into account for 
the calculation. The values denote the following data: average values at birth (1), 
daughter of a random test person (2), tallest living German woman (3), tallest living 
German man (4), tallest woman ever (5), and tallest human ever (6). Apparently, 
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Fig. 9.53 100 measured va- Fig. 9.54 possible extrapolations using linear fit 
lues (body size, body mass) or “cubic fit” (useful!) 


the tallest woman ever was very overweight. The tallest German woman weighed 
only 68 kg at a body size of 206 cm, and was, thus, significantly underweight. ® 
ddd 


>>> Application: “heart rate variability” for life prolongation 

Modern medicine has found that the intervals between individual heartbeats 
are not constant. The frequency of the heartbeat is determined by the inter- 
play between the two vegetative nervous systems — the sympathetic system 
and the parasympathetic system. The sympathetic system keeps the heart 
on its toes or rushes it with stress, danger, sports, etc. The parasympathe- 
tic system, which may be regarded as the former’s opponent, tries to slow 
the heart — for example, during the intake of food or while sleeping. This 
permanent interplay produces a “non-linear behaviour” of the heart, which 
is quite healthy for the body, because the wear of the heart is, thereby, 
uniform. Markus Mooslechner presents the seemingly chaotic structure of 
human heartbeats — as a strange attractor — in a remarkably visual way 
(http://www.humanchaos.net/): In the myriad of measured data (time diffe- 
rences between heartbeats, which are derived from commercially available hardware 
used for running), each of the three successive values are interpreted as the coordi- 
nates of a point in space. The points lie on the space curves which are smoothed by 
means of spline interpolation (Application p. 290). In order to quantify the measure 
of the instantaneous deviation, a sphere with the deviation of the x-value from the 
average of all deviations is placed around the point. The interpolated envelope of 
all these spheres results in strands that vary in thickness. 


® Remark: With appropriate knowledge, one can read out a lot about the heart rate 
variability of the tested person. ® <<< 


>>> Application: unreliable extrapolation 
You have to be very cautious with extrapolations. In the marshes of Los 
Angeles, there are lot more carnivores than herbivores. Large mammals that 
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Fig. 9.55 strange attractor Fig. 9.56 Too many carnivores? 


would slowly sink into the swamps were clearly easy prey for a multitude 
of robbers, but swampy environment could just as well become fatal for the 
latter. daa 


>>> Application: Evolution is design by selection. 

Albert Einstein reportedly said: “God does not play dice”. Torn from the 
context of quantum mechanics, this quote is often cited by “creationists” to 
support their belief that man as the crown of creation could not have evolved 
by chance. The problem is that the process of evolution is on-going and 
never complete: Evolution is not “design by chance” but “design by choice”. 
Accordingly, all creatures, as they are now present on the Earth, are not 
mere random products, nor are they the result of an “intelligent design”, but 
rather they are the result of a millennial — and incomplete — process based 
on a design principle defined by random mutations. 

Among the innumerable random mutations (plants preferentially continue 
as they are), only those that offer a real advantage to their species will 
eventually assert themselves. The principle is as ingenious as it is simple: 
It produces living creatures — each one of its own niche — that are almost 
unsurpassed and extremely complex. A simple and very illustrative example 
is the arrangement of the seeds in a sunflower (phyllotaxis, see p. 531). «<< 


Fig. 9.57 Luckily, what is depicted here Fig. 9.58 the “God does not play dice” 
is only a firework. misunderstanding 
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>>> Application: Murphy’s law 

Murphy’s law is a worldy wisdom that goes back to Edward A. Murphy 
Jr., which makes a statement about human error or the sources of error in 
complicated systems. It is best known as: “If something can go wrong, then 
it will go wrong.” Murphy’s original formulation was: “If there are two or 
more ways to do something, and one of them can end in a catastrophe, then 
someone will choose that sort of thing.” There are dozens of variations of 
this statement on the Internet, and you can spend hours searching in order 
to amuse yourself. 

The latter is probably due to the fact that each of us has already experienced — 
in her/his own life or in her/his immediate surroundings — that things have gone 
wrong which, according to the subjective concept of probability, should never have 
gone wrong. 

The mathematician will have less trouble understanding the matter. Let us take the 
example of a nuclear power station. We have already experienced a core meltdown 
during the Chernobyl reactor accident in 1986. Up until then, such an accident had 
been considered impossible. In such a complex environment, many events depend 
on each other. The connections are not so easy to see. One can say: The probability 
of all safety precautions being simultaneously suspended is zero. If, however, a per- 
son working under stress makes a mistake at one o’clock in the morning, this can 
cause a chain reaction in the truest sense of the word. The corresponding passa- 
ge downloaded from http: //www.umweltinstitut.org/frames/all/m226.htm in 
2010 (compare Grigori Medvedev: The Truth about Chernobyl, Tauris Publishers, 
1991) reads like a crime novel and is somehow reminiscent of the film showing the 
downfall of the supposedly unsinkable Titanic: 

“In order to allow the experiment (the breakdown test) to take place under realistic 
conditions, the emergency program, the so-called “reactor protection system”, was 
switched off .... Yet, since the start of the experiment was postponed, the unpre- 
pared staff working the night shift of April 26 had to take over the experiment, for 
which arrangements had been made that practically rendered the reactor unsafe. 
At the beginning of the experiment, the reactor performance was greatly reduced 
by an operating error of the inexperienced reactor operator Leonid Toptunow. To 
raise them again, the operators removed the brake rods .... Nevertheless, deputy 
chief engineer of the power plant, Anatolij Djatlow, ordered the beginning of the ex- 
periment. The operators switched on too many cooling pumps, so that the reactor, 
which was working with little power, could no longer evaporate the water flowing 
around it. The water began to boil, and the first hydraulic blows were heard. Aki- 
mov, the shift leader, and Toptunov wanted to end the test, but Djatlow ordered 
them to continue. He spoke the historical words: “Just one or two minutes, and it 
will be over! Somewhat more flexibility, gentlemen!” The time was 1:22:30.” «<< 


A Music and mathematics 


This appendix covers a topic that is underestimated in its significance by 
many mathematicians, who do not realize that essential aspects of music 
are based on applied mathematics. This short treatise is meant to show the 
fundamentals. It may even pique interest in the laws that govern the sen- 
suous beauty of music. Additional literature is provided for readers who are 
particularly interested in this topic. 

The mathematical proportions of a sounding string were first defined by Py- 
thagoras, who made his discoveries by subdividing tense strings. This chapter 
describes changes in scales and tonal systems over the course of musical his- 
tory. The author of this appendix is Reinhard Amon, a musician and the 
author of a lexicon on major-minor harmonic systems. 

The chapter will conclude with exercises in “musical calculation” that show 
the proximity between music and mathematics. 


© Springer International Publishing AG 2017 
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A.1 Basic approach, fundamentals of natural 
science 


The basic building blocks of music are rhythmic structures, certain pitches, 
and their distances (= intervals). Certain intervals are preferred by average 
listeners, such as the octave and the fifth. Others tend to be rejected or ex- 
cluded. If one makes an experimentally-scientific attempt to create a system 
of such preferred and rejected intervals, one will find a broad correspondence 
between the preferences of the human ear and the overtone series, which fol- 
lows from natural laws. Thus, there exists a broad correspondence between 
quantity (mathematical proportion) and quality (human value judgement). 
Experiments of this kind have been conducted since the times of Hellenic 
philosophers — such as Pythagoras of Samos (* 570-480 BC), Archytas of 
Tarentum (* 380 BC), Aristoxenus (* 330 BC), Eratosthenes (3rd century 
BC), Didymus (* 30 BC), and Ptolemy (* 150), among others. 

Pythagoras founded the religious-political community of the Pythagorean 
school in Kroton. He was considered to be the incarnation of Apollo and was, 
even during his lifetime, revered as a god. 

During such experiments, a string was strung up on a monochord and sub- 
divided using a measurement scale according to the strict mathematical pro- 
portions 1 : 2, 2: 3 etc. These parts (their halves, thirds, quarters, etc.) 
were made to vibrate in relation to the whole. Assuming a base tone C, the 
following numbererd tone row is, thus, constructed. 

The emphasized numbers are “ekmelic tones”. 

These are prime numbers greater than five. Their exact pitch cannot be 
represented in the Western classical notation system, whose octaves are only 
subdivided into twelve separate steps. They are, in fact, a little bit lower 
than their notation. 
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Fig. A.1 tone row by the subdivision of a tense string 


This row corresponds to the overtone series or harmonic series,! which is a 
physical phenomenon that accompanies every sound process. The points that 
appears next to the numbers transform the distances between the tones into 


1Discovered in 1636 by Marin Mersenne. The associated mathematical rules were discovered in 
1702 by Joseph Sauveur. 
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1:2 | octave 5:8 minor sixth 15:16 | minor second | 2:3 fifth 

2:3 | fifth 5:9 minor seventh 8:9 major second | 5:8 minor sixth 
3:4 | fourth 8:9 major second 5:6 minor third 3:5 major sixth 
3:5 | major sixth | 8:15 major seventh |} 4:5 major third 5:9 minor seventh 
4:5 | major third | 15:16 | minor second 3:4 fourth 8:15 | major seventh 
5:6 | minor third | 32:45 | tritone 32:45 | tritone 1:2 octave 


TABLE A.1 intervals sorted by sound consolidation and size 


proportions (C - c octave = 1:2; c - g fifth = 2:3; etc.). The overtones that 
sound above the base tone (the lowest tone of the row — the one which is 
notated in the score) are not perceived by us as precisely discernible partial 
tones, but as “characteristic sounds”. This phenomenon occurs because the 
strings or columns of air do not only oscillate as a whole, but also in halves, 
thirds, quarters, .... Thus, strings or columns of air do not only produce a 
single tone, but a sum of tones that produces the characteristic sound. The 
most important components of Western music — the intervals — can be des- 
cribed through mathematical proportions of the length of oscillating strings. 
This is also true for the oscillating columns of air in woodwind instruments. 
What is more, this row of pitches following from natural laws contains all 
important intervals for major-minor tonality. 

Since modern times, numbers have been used differently in relation to tones — 
the basis for the description of a pitch now being the frequency, and no longer 
the length of a string. Here, it is important to understand that frequency 
is inversely proportional to the length of a sounding string. A ratio of 1:2 
with respect to string lengths corresponds to a ratio of 2:1 with respect to 
frequency, as a string or column of air that is shortened by half oscillates at 
twice its original frequency. Twelve essential intervals should be given, from 
consonant intervals (those with a high degree of sound consolidation among 
both frequencies — from the octave to the minor sixth) to dissonant intervals 
(those with little or no sound consolidation among both frequencies). 

® Remark: The discovery of the connection between the sensual impression of music 
and its mathematical fundamentals is described in a story by Pythagoras (Fig. A.2): 


octave 


tempered whale tone 
8:9 


6 8 9 12 
Fig. A.2 Pythagoras’s discovery Fig. A.3 ordering of tonal relationships 


“One day, during a long walk, Pythagoras pondered the problem of consonance. He 
wondered whether he would be able to find an aid for his sense of hearing, analogous 
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to the yardstick or the compass as an aid for the visual sense and to the scale as an 
aid for the sense of touch. 

Through a fortunate twist of fate, he passed by a smithy and could clearly hear 
that several iron hammers striking against anvil, with the exception of one pair of 
hammers, produced consonant sounds. Full of joy, as if a god supported his plan, 
he entered the smithy and realized through various experiments that it was the 
varying weight of the hammers, rather than the blacksmith’s output of energy or the 
strength of the hammers, that caused the tonal difference. He carefully established 
the weight of the hammers and their impact, and then returned home. There, he 
fixed a single nail at a corner of the wall, so that no two nails, each having its specific 
substance, would falsify the experiment. On the nail, he hung four strings of the 
same substance, with the same number of wires, the same thickness, and the same 
rotation, and he attached a weight at the utmost end of each string. All strings had 
the same length, and by plucking two strings simultaneously, he managed to create 
the consonances he was looking for, which varied with each pair of strings. With the 
two weights 12 and 6, he produced an octave and thus determined that the octave 
is in a ratio of 2 : 1 — something which he had already realized with the weight of 
the hammers hitting the anvil. With both weights 12 and 8, he received the fifth 
from which he derived the ratio 3: 2. With the weights 12 and 9, he got the fourth 
and thus the ratio 4: 3. When comparing the middle weights 9 and 8, he recognized 
the interval of the whole tone, which he, then, defined as 9: 8. Accordingly, he could 
define the octave as the union of fifth and fourth, that is, 2/1 = 3/2 x 4/3, and the 
whole tone as the difference of the two intervals, that is 9/8 = 3/2 x 3/4." ® 


The graphical representation (Fig. A.3) of this Pythagorean basic order of 
the tonal ratios shows that, simultaneously with the two fifths (2:3), two 
quarter intervals (3:4) and a whole tone interval (8:9) are produced. 

These musical and mathematical rudiments discovered by Pythagoras and his 
pupils were regarded not as a “primary substance” but as a “primary law”, as 
it was believed that the the creation of the world and the cosmos was based on 
invariable numerical proportions. The prevalence of this belief is reflected by 
the elevated status of mathematics, which is still widely considered as the first 
of the sciences. This is also due to the influence on a person’s mental balance 
that is often ascribed to mathematics, as, for instance, in the following quote 
by the philosopher and mathematician Wilhelm Leibniz (1646-1716): “Music 
is an unconscious exercise in mathematics in which the mind is unaware that 
it is dealing with numbers”. 


2 J. Chailley: Harmonie und Kontrapunkt, p. 7ff. 
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A.2 System formation 


Since ancient times, there have been attempts to derive scales and sound 
systems from the existing tones. To this end, people tried to fill the octave- 
space with intervals that were preferably of equal size and of harmonious 
proportions. 

® Remark: The octave — the complete fusion of the two frequencies involved (octave 
identity) — has a special position among the intervals, as it represents the classifica- 
tion framework for all music systems. It can thus be said to function analogously to 
the different floors of a building, with each floor having the same infrastructure. ® 


Important for the formation of systems is the overtone series — primarily, 

the fourth octave from the 8th to the 16th overtone — represented by the 

natural wind instruments. Since the material selected for system formation 

is determined not by the fundamental sciences of matter, but by a person, 

who relies on their senses, two approaches can be formulated: 

e This musical approach is based on external factors (geographical location 
and environment, cultural conditioning, regional listening habits, etc.) and 
is accepted out of habit. 


e People have a predisposition to hearing (see A.5) — that is, the preference 
or exclusion of certain intervals is natural. 


The most important sound systems (the octave space is divided into 5, 6, 7, 
or 12 spacings) are: 


e pentatonic (5 gaps: 3 large seconds and 2 minor thirds), 
e whole tone scale (6 gaps: 6 whole-tone steps = large seconds), 


e pentatonics (7 gaps: 5 large and 2 small seconds (also: diatonic) > church 
mode major, minor), 


e chromatic (12 gaps: 12 semitone steps > major / minor extension, dode- 
caphony). 


® Remark: The half-tone step is the smallest interval in the Western sound system 
and forms the “basic building block” for a definition offering a different approach to 
intervals (major third = 4 half-tones, pure fifth = 7 half-tones, ...). ® 


All systems can be inserted into the above 12 chromatic-tempered spacings. 
The frequent occurrence of the number 12 (number of essential intervals, 
number of spacings in the system, ...), which is also reflected in the keyboard 
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Fig. A.4 piano keyboard 


of piano instruments (as a visual illustration of our sound system), thus 
attains essential importance. 

If we wish to start playing a major or minor scale, each of which consist of 
7 tones, starting on each of the white notes, all 12 keys are required. This 
means that each diatonal heptatonic (major and minor) is integrated into 
the chromatic-enharmonic duodecatonic. 


® Remark: The term ‘enharmonic’ denotes the possibility of changing several tonal 
values at one place in the tonal system (e.g.: f = ef = gbb, ...). ® 


Two different types can be distinguished: 


e Enharmonic duodecatonic: 35 diatonic-chromatic tonal values (each of the 
7 tones modified by bb, b, 4, {, x) result in 12 places (c = hf = dbp) in the 
system — and thus, in the most extensive references to a key. 


e Abstract tempered duodecatonic: there are only these 12 places (abstract 
tempered tonal values) — there can be no enharmonic confusion — without 
reference to a key. 


Here, too, the integers provide a decisive ordering factor. 
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A.3 Tuning instruments — intonation 


Once the number of notes in the tuning system is fixed, the problem of tuning 
(i.e., the exact position of these places — also intonation) becomes relevant. 


® Remark: Intonation (lat. attunement) refers here to the meeting of the correct 
pitch by instruments with flexible tones (strings, singers, wind instruments). ® 
Deviations from pure intervals, which are necessary for musical practice, are 
regulated through temperament control. The basic difficulty in calculating tu- 
ning systems is based on the mathematical phenomenon that integers cannot 
be obtained as powers of fractions. Since all pure intervals appear as mathe- 
matical fractions, a higher octave of an initial tone can never be achieved by 
the sequencing (= multiplication) of equidistant (pure) intervals (except the 
octave). 

Due to these differences, it was and continues to be necessary to set certain 
preferences in the selection of the relevant interval sizes for the system. As a 
result, the respective structure of the music and the resulting sound aesthe- 
tics have produced different approaches over the centuries of Western music 
development. 


Pythagorean tuning (tuning in fifths) 


Pythagorean tuning results from the attempt to calculate all the intervals 
using only the first four numbers. The system-forming interval becomes the 
fifth and thus the prime number 3. Twelve superimposed pure fifths in the 
ratio 2:3 comprise the tonal space of seven octaves. However, the 12th fifth 
does not agree exactly with the 7th octave. The difference (low, but clearly 
audible) between the 12th fifth and 7th octave is called the Pythagorean 


comma 
3 12 9) if 
(5) = 129.746 > (=) = 128. 
2 1 


Thus, the juxtaposition of 12 pure fifths does not result in a closed circle, 
but in a helix. 

The bright balls are at the positions of the octaves and close the circle. The 
dark spheres are at the positions of the fifths and form a spiral. The 12th 
fifth clearly deviates from the seventh octave. 

The octave 1:2 is represented by the prime number 2, the second 2:3 by the 
prime number 3. However, an exponentiation of 2 never coincides with an 
exponentiation of 3. The resulting microintervals are smaller than a semitone. 
Through tempering, we try to prevent this. 

Calculation of the system: 


initial point C tonal note | C 


string length | 1 


weloo | Ey 
win | @ 
wl |09 


NiR | oO 
AIRE] O 
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Fig. A.5 tunes: 7 octaves, 12 fifths 


D is obtained by moving two fifths t from C: 1 x 2/3 = 2/3; second fifth: 
2/3 x 2/3 = 4/9 or respectively the octave offset following |: 4/9 x 2 = 8/9. 

E (4th upper fifth, 2 octaves |) > (2/3x2/3x2/3x2/3) x4 = 16/81x4 = 64/81; 
A (8rd upper fifth, 1 octave |) => (2/3 x 2/3 x 2/3) x 2 = 8/27 x 2 = 16/27 
or by the Fifth D-A 8/9 x 2/3 = 16/27; B (5th upper fifth, 2 octaves |) 
=> (2/3 x 2/3 x 2/3 x 2/3 x 2/3) x 4 = 32/243 x 4 = 128/243. 

Since 1884 (A.J. Ellis), the octave is defined as 1200 cents: 

1Cent = °°/2 = 2!/1200 Therefore, all the cent numbers occurring in the text 
are rounded with the exception of the number 1200. 204 cents, for example, 


means a = 1.125058... % 2. The tempered semitone is then calculated 
as 100 cents (2100/1200 - 91/12 - 2/9), 
This produces the following Pythagorean scale: 


tonal note C} D E F G A B c 
cent values 0 | 204 | 408 | 498 | 702 | 906 | 1,110 | 1,200 
string length with 1 8 64 3 2 16 128 1 
respect to base tone 9 81 4 3 aT 243 2 


The series consists of five whole tone steps at a size of 204 cents and two 
semitones of 90 cents. This gives 5 full tones with 8/9 and 2 semitones with 
243/256. Pythagorean tuning was suited for monophony and thus for the the 
music of the Middle Ages. 

The slightly dissonant Pythagorean third 64/81 was, thus, no problem (the 
pure third has the proportion 64/80). As consonant intervals, only the octave, 
the fifth, and their complementary interval were recognized. With the advent 
of polyphony and the increased preference of merging (consonant) intervals, 
Pythagorean tuning with its dissonant major third lost its importance. 

The polyphonic music that was composed up until the 17th century coped 
with intonation difficulties in various ways. As a rule, musicians opted for 
the “pure” attunement of the most common sounds of musical practice, and 
they avoided or concealed unpleasant harmonies (for example, by means of 
ornaments). Another possibility was the use of more than 12 tones within the 
octave. Thus, keyboard instruments with up to 31 keys were created within 
the octave. 
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The pure tuning (natural-harmonic tuning, fifth-octave tuning) 


In a sense, our ears hear pure tuning. This occurs when each interval is 
intoned according to its tone ratio in the series of natural tones. The obvious 
conclusion is to apply pure tuning to both theory and music, because pure 
thirds and pure fifths exclude each other. The difficulty that is encountered 
here is that such a tuning requires far more than the maximum of 12 positions 
in the system. Depending on the key type, the modulations to be performed 
or the need for transposing, up to 171 positions are required. 

Didymos of Alexandria (#30 BC) divided the monochord strings into 5 equal 
parts and replaced the third 64/81 by 4/5. The deviation in the tones E, A, 
and B as the major thirds of C, F, and G is thus less: 


64: 81 = 0.79012345678 + 4:5=0.8 


The following scale is formed (the proportions are directly from the harmonic 
series): 


tonal note Cl! D E F G A B c 
cent values 0 | 204 | 386 | 498 | 702 | 884 | 1,088 | 1,200 
string length with 1 8 4 3 2 3 8 al 

9 5 4 3 5 15 2 


respect to base tone 


In pure tuning, two different whole-tone steps appear: the large whole tone 
8:9 between c-d with 204 cents and the small whole tone 9:10 between d-e 
with 182 cents. This difference causes, for example, an impure 5th between 
d and a. 

e:h=5/4 « 3/2 = 15/8 = 1.875 (1.5) 

dia= 9/8 «x 3/2= 27/16 = 1.687 (no r.5) 

The “difference” 80/81 (mathematically, it is a quotient) is called the syntonic 
comma. It is by the same value that the fourth a-d is too large. The difference 
between the large whole tone 9/8 and the small whole tone 10/9 (“difference” 
a , : fi = 81/80, the syntonic comma) is evident at many points in the 
system. For instance, at the duplicate tone e, we get the following as the 
octave third of c and as the compound twelfth (i.e. fifth) of c: 

The 5th overtone of C = E 5=E 10=E 20=E 40=E 80=E. 

The compound twelfth 1:3 of C > the third overtone G 3=G 9=D 27=A 
81=E. 

Duplicate tone 80/81. 

The increasing use of instruments with fixed pitches (organs, piano instru- 
ments, etc.) and the inclusion of keys with more and more signatures gave 
impetus to a solution that would eliminate these differences. The resulting 
tempered tunings finally made it possible to play music in all keys. 


Mid-tone temperament 


This term was coined by Arnold Schlick in 1511 and by Pietro Aron in 
1523/29. The term “mid-tone” is derived from the fact that the mean value 
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between 8/9 (large whole tone) and 9/10 (small whole tone) determines the 
new whole tone. The natural major third — which takes precedence over the 
fifth — is thus not divided as in the overtone series 8: 9: 10, but as a geometric 
mean. That is, it is divided into two equal whole-tone steps. 


tones 6.) b d e 
proportion + = MT 2 3 
cent O | 182 | 193 | 204 | 386 


The mid-tone scale is made in the following way (according to Arnold Schlick), 
with K standing for the Pythagorean comma: 


tonal note Cc D E F G A B c 

cent values 0 193 386 503 696.5 890 1,083 1,200 

string length with re- 1 | 8:5 4 3, Kk |2.K |3,K /] 8.K 1 
Paard 5 4% 4 | 3° 4 | 5% a | 15' 4 2 


spect to base tone 


Mid-tinted tones were used during the 19th century, when transposing was 
limited and there was no enharmony yet. This resulted in a clear tonality 
characteristic. 


® Remark: The term ’tonality characteristic’ means that each tonality has a more 
or less clearly distinct mood or expressive content due to its interval disposition 
(sharp, mild, serious, soft, ...). ® 

In order to make the “wolf tone” (an extremely impure fifth) between gy and 
di more bearable, one had to increase either the sound reserve (one key for 
gf and ab or for dj and eb) or the fifths at the expense of improving the pure 
thirds (= “non-equilibrating temperament”) 


Tempered tones 


There are two basic possibilities with only 12 places in the system. 

On the one hand, one can unevenly distribute the Pythagorean comma over 
thirds and fifths with 23.5 cents — these are uneven tones. On the other 
hand, one can evenly distributed over all the fifths — this is the equilibrium 
or equal temperament. The dissimilar tones were all calculated by Andreas 
Werckmeister (1645-1706), who was for a long time credited with the ‘inven- 
tion’ of constant temperament control. Johann Phillipp Kirnberger (1721- 
1783), a composer and music historian who had studied as a pupil of J.S. 
Bach from 1739-41, developed three temperaments (K1, K2, and K3). Kirn- 
berger’s temperaments became not only the most widespread, but they also 
asserted themselves among all irregular temperaments for the longest time. 


The C major scale with Kirnberger tones: 


tonal note Cl! D E F G A B c 
cent values 0 | 204 | 386 | 498 | 702 | 895 | 1,088 | 1,200 
string Tength with 8 4 3 2 96 8 1 

1 9 5 4 3 161 15 2 


respect to base tone 
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The most primitive possibility, “equal temperament” (calculated by the Dutch 
mathematician Simon Stevin in 1585 and also in China around the same 
time), prevailed in the 18th century, showing the various possibilities of ba- 
lance (medium-temperament, uneven-floating temperament). The 12th fifth 
exceeds the 7th octave by 23.46 cents. In order to make a circle from the 
spiral of fifths, the “Pythagorean comma” is distributed uniformly (equidi- 
stantly) over all twelve fifths. The octave is, thus, divided into twelve equal 
half-tone intervals 1: 4/2. In this equilibrium, no interval aside from the 
octave is really pure. However, the differences are so small that the ear can 
tolerate it very well. This eliminates the possibility of tonal characteristics 
(= differences in intonation), (which was the contemporaries’ main objection 
against this new temperament). 


® Remark: The desire to maintain tonal characteristics prevented the “new” tempe- 
red equilibrium from being accepted in musical practice until well into the middle 
of the 19th century. ® 

Since no fifth is pure (i.e. C - G sounds just as good as Cj - Gy or Cb - Gp), 
no interval has to be avoided for performance reasons. A certain “instability” 
that may arise from this can be countered in singing or when playing string 
instruments by spontaneously adjusting the intonation of the respective tonal 
situation in the piece. 

Thus, we have the following equidistantly tempered (= equilibrium) scale, 
where stands for ‘/2: 


tonal note C D E F G A H Cc 
cent values 0 200 | 400 | 500 | 700 | 900 | 1,100 | 1,200 


string length with 0 2 rf ; 7 F 7 
respect to base tone i 1/d 1/r 1A 1/A 1/r 1/d 1/2 


Summary 


The tuning with the Pythagorean numbers 1, 2, 3, and 4 was, indeed, re- 
stricted (consider the third!), but unproblematic. With the addition of the 
number 5 and thus the pure third, problems arise: (a) One needs more than 
12 pitches if one wants to have pure fifths and pure thirds in the system. (b) If 
a certain selection of tone places is determined, one must put up with impure 
thirds and/or fifths. As long as the third note was handled in early multi- 
voice music and considered as a dissonance, tuning problems were of little 
significance. The acknowledgement of the third as a consonance and its use 
in polyphony result in the above-described problems and different approa- 
ches to solving them. In spite of the lost tonal characteristics, the resulting 
equal temperament is a system which offers a satisfactory and understanda- 
ble performance practice for the listener with a relatively small number of 
fixed pitches. 
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A.4 Numerical symbolism 


There is an almost universal view of the “symbolic meaning” of numbers 
that can even be found in philosophical speculation and according to which 
numbers and their relations constitute the essence of reality. Especially in the 
field of the arts, it is common to include meta-levels and go beyond sensory 
perception. 

Some examples of this shall be given here: 

e The fundamental note (see natural tone series) as reference point, i.e. the 

number 1 of the tone series, refers to God, the origin, the unitas. 


e Exceeding the “god-willed” order brings misfortune: The 5 (after the divine 
3 and cosmic all-embracing 4) in the overtone series — the number of 
thirds and sixths — was avoided for a long time. The fifth day of the week 
brings misery. At the 5th trumpet sound, a star will fall from the sky 
.... Similar beliefs can be found in relation to the number 11 (number of 
transgressions, sins, ... ). 


For Pythagoreans, the number is raised to the dimension of a general princi- 
ple in the appearance of beings. The harmony of the world, the cosmos, and 
human order was seen and valued from the validity of numerical relations. 
Therefore, the personal union of the mathematician, astronomer, musician, 
and philosopher was a self-image. The most important number was 10, being 
the sum of the four first numbers 1+-2+3-+4. This is a typical starting point 
for Western thinking and its philosophy. What is fundamental and in ac- 
cordance with the striving for perfection is the metaphysical, ontological, 
and axiological primacy of that which is finite, orderly and harmonious, as 
opposed to chaos, disorder, infinity, and irregularity. The extent of these spe- 
culations is reflected by the fact that the pupils of Pythagoras introduced a 
division into mathematicians (those dealing with quantitative realities) and 
so-called acousmatics (those who consider qualitative aspects, i.e. those areas 
that give emphasis to sound and its cosmic and above-all mental dimensions). 
Some also included a third category in this division — the “esoteric”. 

Especially from the Baroque period onwards, references based on nume- 
rical symbolism, biographical aspects of the composer, text-related cross- 
connections, monograms in the form of numbers, etc. were incorporated in 
musical works. This was accomplished through the selection of the time me- 
ter, number of bars, number of parts, interval structures (second stands for 
2, third stands for 3, ....), number of tones of a melody, and so on. Especially 
in J.S. Bach’s works, we find countless examples of this. The cross-sum of his 
name based on the the numerical alphabet (this was such common practice 
that one could assume it would be generally understood |[I = J, U = V !]) 
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resulted in the number 14, which, as well as the reversal number 41, would 
appear in melodies, measures, and other references in the work. 

The number 29 based on the initials J.S.B., with which Bach signed many of 
his works, has a second meaning: S.D.G. (Soli Deo Gloria = Only to the glory 
of the Lord). The number 158 is also quite frequent: Johann Sebastian Bach. 
When he decided to join Mizler’s society,’ he waited for the membership 
number 14. Obviously, the waistcoat that he wears in his portraits has 14 
buttons. Moreover, the canon (BWV 1076), which he wrote for this occasion, 
references the year 1747 when he joined Mizler’s society. It also cites a bass 
line by G.F. Handel, consists of 11 notes — Handel was the 11th member of 
the society — and the number 14 is constantly present. 


Timing — rhythm 


In addition to tones and intervals, the temporal ordering process is probably 
the most important component of music. Music is regarded as a period of 
time, with the regular recurrence of rhythmic sequences in the continuous 
flow of metrics forming the basis of perception. Sounds are not randomly 
ordered, but instead their temporal sequence exhibits a structure, a certain 
order. The time of music is the pulse. The pulse consists of two-part or 
three-part units of this pulse. The continuous values are produced by simple 
multiples or fractions of the largest unit of measurement. Here as well, with 
this temporal division of the material, we are faced with the principle of 
counting and thus with that of mathematics. 


A.5 Harmonics (basic fundamental research) 


Basics 


This science, based on the integral proportions of the overtone series, is essen- 
tially anthropological. The fundamental idea is that the order that appears 
in the numerical law of overtone series is universal and cosmic as it governs 
both animate and inanimate nature, as well as all human beings. Simple 
proportions have an effect on the mind and the mood of human beings. In 
music, these are the intervals. In the fine arts and architecture, there is, for 
example, the “golden section” or the “Fibonacci series”. 

By means of analogies, proportionality is sought in natural sciences such 
as chemistry, physics, astronomy, biology, medicine, as well as in art and 
architecture. 


3 Lorenz Christoph Mizler — a former Bach scholar — later became a lecturer in mathematics, 
physics, and music. He founded this “Societaét der musikalischen Wissenschaften” (Society of 
Musical Sciences) in 1738. 
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number of intersections | accordance in percentages interval | proportion 
72 | 100 octave 1:2 
42 | 58 fifth 2:3 
28 | 39 fourth 3:4 
24 | 33 major sixth 3:5 
18 | 25 major third 4:5 
14 | 19 minor third 5:6 
10 | 14 minor sixth 5:8 
8 | 11 minor seventh 5:9 
2|3 major second 8:9 
2/3 major seventh 8:15 
0 | 0 minor second 15:16 
0 | 0 tritone 32:45 


TABLE A.2 from R. Haase - “harmonical synthesis” 


Hearing disposition 

The preference of intervals when forming systems from integer proportions is 
fundamental in the nature of man. In the ear itself, in addition to the com- 
bination of sounds (sum and/or difference of two fundamental frequencies), 
“subjective overtones” and “ear overtones” also arise. The series of harmonics 
superimposed in interval formation have more or less common “intersections” 
depending on the proportion. These result in an order on which the qualitati- 
ve division of the intervals into consonances and dissonances has been based 
for centuries (Table A.2). 


Harmonious laws in other areas 


Analogous references to integer proportions of the musical intervals can be 
made in many fields of science. Astronomy: The discovery of harmonious 
laws was already a matter of concern for the ancient philosopher Johannes 
Kepler. He was a harmonist in the best sense of the word and a modern-day 
natural scientist who tried to prove the spherical harmony by calculating the 
planets in their orbits around the Sun, the day arcs, and the angles that 
could be measured from the sun to the extreme points lying on the planet’s 
elliptical orbits within 24 hours. Proportional orders are detected in other 
areas, such as crystallography, chemistry (where proportional orders appear 
in the periodic system of the elements), physics, biology, and, last but not 
least, anthropology, where numerous “musical” proportions can be found in 
the human form. 


Further reading: 

Amon, R.: Lexikon der Harmonielehre, 2nd edtion, Doblinger/Metzler, Wien-Miinchen 2015 

Amon, R.: Lexikon der musikalischen Form, in coop. with Gerold Gruber, Metzler, Vienna 2011 

Amon, R.: Piano Essentials, Eine Anthologie von kurzen Stticken in Theorie und Prazis, 
Doblinger, Vienna 2008 

Gauldin, R.: Harmonic practice in tonal music, New York 1997 

Levine, M.: Das Jazz Theorie Buch, Advanced Music 1996 

Piston, W.: Counterpoint, New York 1947 

Piston, W.: Harmony, Fifth edition, New York 1987 

Ulehla, L.: Contemporary Harmony, 1994 
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A.6 Numerical examples 


>>> Application: Pythagorean scale 

The scale that is created through Pythagorean tuning consists, like our mo- 
dern scale, of seven intervals (five whole tones and two semitones): Take a 
vibrating string that produces the fundamental C. The notes F and G are 
then produced by reducing the string to 3/4 (fourth) and 2/3 (fifth) of the 
total length. When reducing the string to 1/2 (octave), the first tone c of the 
next scale is obtained. Now the remaining tones between C and c, namely 
D, E, A, B can be calculated from the octave and the fifth as a proportion 
(2/3)" : (1/2). For which powers n and m can the smallest possible fun- 
damental tones be obtained? What are the corresponding reductions? What 
gradations result in this scale? 


Solution: 
We have t (5) (5) (5) gmonm 
3) 2 a gn 
We only search for integers m and n such that 
al gn+em 
=< <1 (A.1) 
2 oC 
holds. We solve the problem through testing: 
m. | nm | arte 3” | (A.1) satisfied? | base tone t 
-1] 0 1/2 1 yes c ee 
0]; 0 1 1 yes Cc 1 
Oo; 1 2 3 yes G 273 
0} 2 4 9 no 
1 0 2 1 no 
1 1 4 3 no 
1 | 2 8 9 yes D 8:9 
1 | 3 16 27 yes A 16: 27 
1 4 32 81 no 
2/4 64 81 yes E 64:81 
2 5 128 | 243 yes B 128 : 243 


The gradations are now the quotients from the reductions for the two succes- 
sive notes. This quotient is either 8:9 (“full tone”) or 243 : 256 (“semitone”). 


Semitones are available between E and F (3 : ~ = 33) or B and ¢ respec- 
tively. 
The same applies to the vibrating air columns in wind instruments. <<< 


>>> Application: equally tempered scale 

In the so-called “equally tempered scale”, all tones are produced by dividing 
the string length corresponding to the previous tone by a constant ratio of 
A, once for a semitone, and twice for a whole tone. In addition to the initial 
tone C, the Pythagorean values (Application p. 515) are somewhat corrected. 
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What is the value of \ and how large are the deviations from the Pythagorean 
scale? 

Solution: 

In the scale (Application p. 515), we first have two whole tones, then a half- 
tone, then three whole tones, and finally a last half-tone. The shortening to 
1/2, thus, occurs as follows: 


Be aoe ee eee ae ae : + i? 7, 1272 


Now we create a table: 


base tone Pythagoras | equally tempered | deviation 
Cc 1 1 0 
D 8:9 = 0.88889 1: A? = 0.89090 | —0.00201 
E 64:81 =0,79012 1: A* = 0.79370 | -0.00358 
F 3:4 = 0.75000 1: \° = 0.74915 0.00085 
G 2:3 = 0.66667 1:7 = 0.66742 | —0.00075 
A 16: 27 = 0.59259 1:A9 = 0.59460 | —0.00201 
H 128 : 243 = 0.52675 1: A? = 0.52973 | —0.00298 


The error in the individual tones seems to be quite small, but it is quite 
audible to the trained ear. The new classification has the advantage that 
there are now no inconsistencies in transposing. For the construction of the 
string lengths, see Application p. 516. <<< 


\ 


Fig. A.6 string lengths 


>>> Application: construction of the string lengths of the tempered scale 
For the equally tempered scale (Application p. 515), the powers of \ = \/2 
have to be determined. Show the correctness of the construction of the string 
lengths in Fig. A.6. 

Solution: 

The triangle with side lengths 1, 1, A is isosceles (Fig. A.6). The ratio of the 
shorter side to the longer side is 1: \. Next, we draw the circular arc centered 
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at the left vertex of the triangle and passing through the top vertex. This 
arc intersects the base line at distance A from the left vertex. By drawing the 
parallels to the opposite short side, the long side is also multiplied by A due 
to the intercept theorem, and it thus has the length )?. 

For twelve repetitions, the string should be twice as long. (The twelfth root 
can also be constructed geometrically.) The black strings in the diagram 
correspond to the C major pentatonic scale and the two red strings complete 
it to a C major scale. <<< 


>>> Application: length of the air columns in wind instruments (Fig. A.7) 
We have discussed the “equally tempered scale”. In this scale, all tones are 
produced by shortening the length of the string or the length of the air 
column corresponding to the respective preceding tone in the constant ratio 
\=1: X/2. What is the arrangement of the pipes when a) pipes with the 
same diameter are always used, b) the diameter of the pipes that are used is 
proportional to their length (as is the case with the organ). 


Fig. A.7 The width of the air columns can grow linearly or exponentially. 


Solution: 

First, suppose the pipe diameter d is constant, e.g. d = 1. The longest pipe 
has length L. The next whistle has length \L, then A? L, and so on. The 
pipe of the z-th tone, thus, has the length 


y=A"°L_ (with A\=1: 4/2 ~ 0.9439) 


or 
y=e*L (with c=1/A= V2 1.0595) 


respectively. Fig. A.7, left (tubular chimes) shows an exponential function 
placed over a photograph (similar to a xylophone). It can be seen that the 
pipe length is longer than the functional value by a constant distance: In this 
area, the air cannot vibrate. 

Now suppose the pipe diameter d is not constant, but proportional to the 
length of the vibrating air column (Fig. A.7, right). Comparable points are 
on a straight line: The rectangular cross sections are similar to one another 
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(always with the same similarity factor) and can be mapped into each other 


by dilation. 


‘ull 


| MOA 


Fig. A.8 organ pipes ... 


An organ has hundreds of different pipes. It is by no means necessary to 
arrange these pipes in order of height (even though there is a German saying — 
“standing like organ pipes” meaning to stand in a row from tallest to shortest 
— that seems to suggest so). Some organ builders arrange their pipes in such 
a way as to create interesting patterns, which are more difficult to classify 
mathematially (Fig. A.8, part of the organ in Altnagelberg, Lower Austria, 


during reassembly). <a< 


B Numbers 


This chapter deals with special numbers that have had a significant influence 
on the history of mathematics. First, we will discuss two simple examples 
from the world of “numerology”. One of these is Goethe’s famous “magic 
formula”, with which we can describe a magic square. 

Then, we will discuss the number 7 = 3.14159... and Euler’s number e = 
2.71828... in greater detail. These numbers play an exceptional role in ma- 
thematics, and can be found in many diverse applications. Both numbers are 
transcendental — in other words, they cannot be found as the solutions of 
algebraic equations with integer coefficients. 

We will then turn our attention to the golden mean and to the closely related 
Fibonacci numbers, which are often found in nature — for instance, in the 
structures of flower blossoms. 

Finally, we will discuss complex numbers, which are greatly fascinating despi- 
te their strange definition (for which it is necessary to introduce an “imaginary 
unit” 7 with i? = -1). Complex numbers allow us to solve many problems that 
seem perplexing “in the world of real numbers”, and they explain the funda- 
mental theorem of algebra, according to which every algebraic equation of 
degree n possesses exactly n solutions (if multiple solutions are counted with 
their multiplicity). 

An almost magical relation between e, 7, and i becomes manifest in the 
famous equation 0 = 1+e’”. By performing calculations in the complex plane, 
we can use complex numbers to accomplish elegant solutions to problems in 
kinematics and other technical disciplines. 
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B.1 Numerology 


Since the dawn of human culture, numbers have exerted a magical fascination 
on humans. Many situations prompt us to say: “This can be no accident.” The 
“witch’s one-time-one” by Goethe from his magnum opus “Faust” is almost 
certainly no accident. 


Witch’s one-time-one and magic squares of order 3 


Magic squares have fascinated humans for millennia. These are square matri- 
ces of numbers (n rows, n columns) where the first n? integers are arranged 
such that the sum of each row, column, and both diagonals is equal. This 
sum s is, thus, given: 


2 
eS {2 ie ae 4 
s=— ik, for n= 3: s= yikes a ae 5 45. 
TM gel 3 fal 3 3 


There are many known examples of magic squares of higher order, but only 
one for the order n = 3 (of course, it is still possible to flip and mirror this 
solution in several ways). A magic square of order 4 was found by A. Diirer.* 


>>> Application: How can one memorize a magic square of order 3? 


The answer is found in J. W. von. Goethe’s “Faust” in a section called “witch’s 


one-time-one”:? 


This must thou ken: 
Of one make ten, 
Pass two, and then, 
Make square the three, 
So rich thou'lt be. 
Drop out the four! 
From five and siz, 
Thus says the witch, 
Make seven and eight. 
So all is straight! 
And nine is one, 

And ten is none, 

This is the witch’s one-time-one! 


Faust then ends the verse with a fairly terse comment: 
The hag doth as in fever rave. 


Thttp://mathworld.wolfram.com/DuerersMagicSquare.html 
*http://www.fullbooks.com/Faust-Part-13.html 
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Goethe gave obfuscated advice on how to create a magic square of order 3. 
Several interpretations exist, the following being by the author: 


CN 
2/2 


Fig. B.1 Of one make ten, Fig. B.2 Pass two, and then, 


1. “This must thou ken: Of one make ten.” does not refer to the numbers 


themselves, but to the number of squares! Turn one square into 10, by 
adding 9 more (in the form of a matrix) (Fig. B.1). 


2. “Pass two, and then,” is to be taken literally: The 2 moves from the second 
to the third position (Fig. B.2). 


Z 


Fig. B.3 Make square the three, Fig. B.4 Drop out the four! 


3. “Make square the three, so rich thou'lt be.”: The 3 is inserted (at position 


four, Fig. B.3). With this step, much is already accomplished, as we will 
soon realize. 


4. “Drop out the four!”: Do not insert the 4 and proceed to the next square 
(Fig. B.4). 


21/4] [2] /4/9]2 
3/-/5| [3/-17 31517) /3/517//3]5 


6 8 8} |6//8/1/6]|8}1/6 


Fig. B.5 5,6 > 7, 8 Fig. B.6 All is straight! 


~ 


5. “From five and six, thus says the witch, make seven and eight.”: Now, it 


is the turn of 5 and 6. In their place, we insert 7 and 8 (Fig. B.5). 


6. “So all is straight!”: This is the key sentence: All is truly accomplished. 
We can now supplement the sums towards 15 (Fig. B.6)! 


7. “And nine is one, and ten is none, this is the witch’s one-time-one!” We 


must take this literally: Nine squares are combined into one, omitting 
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419]2 
3|5| 7) RM 
8/116 


Fig. B.7 And nine is one, And ten is none, ... 


square number ten (Fig. B.7). Done! Now, try to repeat the procedure 
from Goethe’s original poem on your own! 


>>» Application: Magic squares of order four (Fig. B.8, Fig. B.9) 
Let us now consider magic squares of order four (total sum 1+...+16 = 136, 
therefore, the sum of each row/column/diagonal is 136/4 = 34). 


Fig. B.9 ancient Indian magic square 


In the year 1514, Albrecht Diirer showed his discovery in an engraving entitled 
Melancholia I. Remarkably, the bottom row of the engraving reads 4 15 14 1 
Not only do the numbers 15 14 appear, but also his initials D. A. > 4 1. 

Another famous magic square of order four can be found in the Parshvanath 
Jain temple in Khajuraho, India. Fig. B.9 not only shows this square, but it 
also provides a “translation table” for the ancient Indian numbers engraved 
in the square. <<< 


>>> Application: ancient temples and tables (Fig. B.10) 

In Fig. B.9, we have an example of a “table” engraved with ancient Indi- 
an numbers. The famous Rosetta Stone, found in 1799, is engraved with 
three versions of the same text translated into ancient Egyptian hieroglyphs, 
Demotic script, and ancient Greek. It provided the key to the modern un- 
derstanding of Egyptian hieroglyphs. The example given in Fig. B.10 on the 
right-hand side shows how numbers were displayed. 

Solution: 

The image shows a list collecting the celebrations donated by Tutmose (Tut- 
mosis) II (18. dynasty, 15th century BC) at the temple of Karnak (rows on 
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Fig. B.10 Luxor: temples and tables with numbers 


the image’s left-hand side) including the sacrifice ratios (small boxes above 
the squares). The numbers in this table can clearly be recognized: A bar 
stands for 1, an upside-down U stands for 10, and a curl stands for 100. 
The donations denoted by these numbers consisted of bread and pastries, 
or the equivalent amount of cereals required for the baking of these goods. 
The unit is one bushel and the necessary amount of bushels are given in the 
corresponding boxes. PPP 


>>> Application: dates in Denmark 

Time and again, we are confronted with simple number games that yield 
astonishing results. The mathematical background underlying these games is 
usually quite simple. We will discuss only one example of such a game here, 
but this example is sure to astonish the reader: 

Think of a number from 1 to 10. Multiply it by 9. From the result of the digit 
sum, subtract 5. The result corresponds to a letter in the alphabet. Think of a 
European country with this initial letter that does not border on Switzerland. 
Now think of a fruit that begins with the same initial letter. The final question 
is: What do dates have to do with Denmark? 


Solution: 

The mathematics behind this is extremely simple: One will realize quickly 
that the number 9n for n = 1, 2,---, 10 always has the digit sum 9. So, if 
you subtract 5, you will get 4 in any case, and the corresponding letter 
in the alphabet is D. In the English language, there is only one European 
country with the intial letter D, namely Denmark. (In German, there are two - 
Germany and Denmark - but Germany is ruled out because it shares a border 
with Switzerland. So, here as well, the only possible answer is Denmark). 
Since there are hardly any fruits with the initial letter D (dragon fruits and 
dewberries are not that common), it is very likely that the person playing 
the game will pick the date... PPP 
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B.2 Rational and irrational numbers 


A rational number is a fraction 2 where the numerator p and the denomi- 
nator q are integers. For q = 1, one obtains integers. Conversely, a number is 
irrational if it cannot be represented as a fraction. 


>>> Application: weekday and date 

The following formula goes back to the young Gauf and is used for the 
determination of the weekday w for a given date (century c, year y (0< y < 
99), day d. The month m is given according to the Julian calender: So, March 
means m = 1, February corresponds to m = 12: 


w = (d+ [2.6-m-0.2] + y+ [y/4] - 2c + [c/4]) mod 7. 


Prove the formula. 


Proof: In contrast to the “Danish date example”, this task is not easy. The orbital 
period of the Earth lasts 365.242198... days, and not 52 weeks or 365 days or 3654 
days (as in the Julian year). In order to account for this irrational length of an Earth 
year, the year must be approximated as closely as possible by means of a fraction. 
In 1582, the approximation of 365.2425 days (Gregorian year) was introduced, and 
on October 4th, 1582, 10 days were skipped as a measure to correct the Julian 
calender. This Gregorian year can now be represented as follows: 


1 1.1 
4 100 400° 


The expression mod 7 (“modulo 7”) refers to the remainder after dividing by 7, ie. 
a number from 0 to 6. The corresponding day of the week is then assigned to each 
modulus (Sunday, Monday, ...). When d is increased by one, w jumps cyclically by 
one. 

The expression [2.6-m — 0.2] means the following: Take the smallest integer equal 
or greater than the expression 2.6-m-0.2. How did Gaufs come to this expression? 
Consider the x-th of March and the x-th of April. Since March has 31 days, the 
corresponding day of the week jumps forward by 3, e.g. from Sunday (=0) to Wed- 
nesday (=3). The x-th of May jumps by 2 days. In this case, the day falls on a Friday 
(=5) because April has 30 days. The x-th of June is a Monday (=8=1) because May 
has 31 days, and so on. Until February, the “number of jumps” amounts to 29, which 
is an average (at 11 jumps) of about 2.6 days per month. Now, you have to retain 
the somewhat irregular alternation between 30 and 31 days a month (historically 
evolved), and Gaufs discovered by trial-and-error that this works with a subtraction 
of 0.2. However, the result of this always exceeds the “number of jumps” by 2, which 
is actually quite favorable, because it will yield the correct weekday (otherwise all 
days would be cyclically shifted). 

The year Y is divided by Gauf? into c hundreds and the remainder y < 100 (for 
example, 2007 is divided into 20 and 0= 7), and Y = 100-c+y. Let us approximate 
the year through 52-7+1+ A - ane + days. If Y years have elapsed, the weekday 


400 
is advanced by the Y - 365.2425-fold value. The expression Y - 52-7 is certainly 


365.2425 = 52-7+14 
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divisible by 7, and is, therefore, not relevant for the weekday. The following number 
is important: 


a 
4 100 400 


So, we are almost done. The number 124-c has the same remainder when divided 


)= 1006+ 25+¢ ho Hee a 
4 4 100 400 


(100-c4 y)-(1 


by 7 as -2-c (because 126 is divisible by 7). The summand —755 + qj does not 


provide an integer contribution. © 

Numerical example: February 16th, 1988 is equated to December 16th, 1987 (d = 16, 
m = 12, c= 19, y = 87) in the Julian calendar. This results in w = 121 mod 7 = 2 and 
is, thus, a Tuesday. 

September, October, November, and December are, as the Latin names suggest, the 
seventh, eighth, ninth, and tenth months. << 


The diagonal of the square 


As we have seen in this book, the strange irrational numbers 7 (the “circle 
number”), e (Euler’s number), and y (the golden ratio) play a fundamental 
role in mathematics. It is not so easy to explain why they are fundamental. It 
has been shown that not only natural numbers and fractions are of essential 
importance. 


>>> Application: a proof with fatal consequences 

A pupil of Pythagoras was able to prove that the ratio of a side a and the 
diagonal d = 2a of a square are not rational. Prove that /2 cannot be 
represented by a fraction. 

Proof: We prove this indirectly by showing how the contrary statement results in 
a contradiction. 

Suppose there is a fraction 


Pasa. 
q 


The fraction is “coprime”, i.e. already “reduced”, so that the greatest common divisor 
of p and q equals 1. Therefore, it is not possible for both p and q to be even (divisible 
by 2), because then one could reduce by 2. 
If we square the equation, then we have 
2 

a =2> py =2 a 

qd 
This means that p? is divisible by 2, and so is p (the square of an odd number is 
also odd). Therefore, g, and thus also q’, is odd. From this we conclude that 2q? is 
divisible by 2, but not by 4. On the other hand, the square of an even number p is 
always divisible by 4, and therefore, we have a contradiction to the assumption. 
In a completely analogous way, one can prove that every square root of a number 
which is not a full square must be irrational. © 


® Remark: This famous proof can be said to be really “classically Greek”: a guided 
verbal series of logical conclusions. Pythagoras was apparently very angry about 
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the arrogance of his pupil, because such irrational numbers seemed to have no place 
in a “rational world-order” — let us remember his scale (Application p. 515), which 
abounds with rational conditions. However, we want to believe that the alleged 
murder of his student was not instigated by him. This proof was first laid down 
by the Greek mathematician and “geometer” Euclid (ca. 325-270 BC). Euclid’s 13- 
volume work The Elements served as a basic textbook of geometry for more than 
2,000 years! ® Prey 


>>> Application: the regular polyhedra and Greek natural philosophy 

A Platonic solid is a regular, convex polyhedron consisting of congruent re- 
gular polygonal faces with the same number of faces meeting at each vertex 
(see also Application p. 231). In Greek natural philosophy, the first four of 
these polyhedra (tetrahedron, cube, octahedron, and icosahedron) stood for 
earth, fire, air, and water. The fifth — the later discovered dodecahedron (Fig. 
B.11) — represented the entire universe (called quintessence = fifth essence). 


Fig. B.11 rolling dodecahedra ... Fig. B.12 non-convex regular polyhedra 


If we do not insist on convexity, there are some other star-shaped polyhedra 
that fulfil the other conditions, such as the Kepler-Poinsot polyhedra, which 
are named after their discoverers and are shown in Fig. B.12: The top row 
shows the great stellated dodecahedron and the great icosahedron. Below, 
there are the great dodecahedron and the small stellated dodecahedron (the 
latter had already been discovered earlier by Paolo Ucello). These solids were 
also interpreted mystically. <<< 
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B.3 Famous irrational numbers 


Some numbers have enormous significance in mathematics, not only from 
a historical perspective. Examples of such numbers are 7, Euler’s number 
e, and the “golden ratio” y. None of these numbers can be represented as 
a fraction. The first two are even transcendental numbers: They cannot be 
found as the solution of an algebraic equation. 


The number 7 


Since antiquity, it has been known that there is a strange number which can 
be used to calculate the circumference of the circle or the circular area: the 
“circle number” 

m= 3.1415926.... 


More or less good approximations to this number date as far back as anti- 
quity. The ancient Egyptians approximated the number by (16/9)? = 3.1605 
and the Babylonians by 3 + t = 3.125. Archimedes proved that the number 
must lie between 34 » 3.1428 and 370 x 3.1408. 


7 
In 1659, John Wallis proved 


ee a ee 
qm 2-4-4-6-6-8-... 
In the year 1736, Euler proved the formula 


12 [o) 
a” 


In 1761, Lambert showed that 7 is not a fraction, but it was not until 1882 
that Lindemann succeeded in presenting the sophisticated proof to demons- 
trate that a could never be the solution of an algebraic equation, i.e. 7 is 
transcendental. 


Euler’s number 

The transcendental number e appears in calculations of interest rates as 
follows: 

If one has 1 monetary unit (MU) in a year (1a) with 100% charge in interest, 
then one obtains 2 MU. At a 50% interest rate per half-year, one receives 
(1 + Ay MU. With Le percent in the period ta. the value is then (1 + 1)" 
at the end of the year. 

For n= 1, 2, 3, 4, 5,... the obtained values are 


0, 005 B37 Ad BAG, 


If you refine this “infinitely accurately” (n > oo), you get the strange limit 
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1 n 
e= lim (14 -) = 2.71828... 
noo n 
The function e” is known to agree with its derivative. It is, with the exception 
of a multiplicative factor, the only function that behaves like this. This can 
be meaningfully interpreted as: The increase in the function e” is as large 
as the corresponding function value at each point, and this has to do with 
natural growth. The offspring of a population is, indeed, proportional to its 
current size. 

Finally, the function e” has the simple Taylor expansion 


which makes the base e, in the age of the computer, an ideal reference number 
for all exponential functions. 


The strange golden ratio 


The two solutions of the quadratic equation 


are the numbers 


_1+v5 1-J/5 


d = 
5 and ag 5 


(Application p. 37). In writing, the two numbers are often denoted by ® and 
vy. They possess a wealth of interesting properties. 


(B.1) 


ay 


>>> Application: a “maximally irrational” number 
Since the two numbers © and y (Formula (B.1)) contain the irrational number 
V5, they are irrational (Application p. 525). Therefore, 


® = a, = 1.6180339887... and y= ag = —0.6180339887... 
have no periodicity, even if you extend the number of decimal places arbitra- 


rily.3 
Show the simple relations 


1 z= = 1 2 = == 
(1) 5 a (2) Ps 
and thus, 


3There are websites on which ®, e, and 7 are listed with an accuracy of several thousands of 
decimal digits. 
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1 1 
(3) @=14+-——, (4). y=-— +. 
a A ee 
1+ T 1+ T 


The two numbers can be represented by “infinite continued fractions”.4 


Proof: Ad (1): 


1 2 &V5=-1) | (5-1) © 
@ V5+1 (V5+1)(V5-1) 5-1 


_V5-1) V5+1-2 V5+1 
2 2 2 


Ad (2): 


1 2  &Vv5-1) (5-1) V5-1_ 
6 V5+1 (V5+1)(V5-1) 5-1 2 


Ad (3) and (4): The number ® is a solution of the equation 


1 © 
® +1 


On the right-hand side, we divide by ® and get the relation 


The left-hand side is inserted into the right-hand side. Thus, we replace $ in the 
right-hand side with es and obtain 
& 


Repeated insertion results in a “continuous fraction”. The equations (3) and (4), 
therefore, follow from (1) and (2). © <<< 


4Tt can be shown that the golden number is irrational in the “maximal” sense so that the worst 
way to approximate it with a given denominator is by means of a rational number. This is due 
to the fact that the elements of the continued fraction (the numbers that appear first in the 
denominator) are all one. 
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B.4 The Fibonacci numbers 


“A rabbit couple gives birth to a young couple every month from the second 
month onward. The young couple, in turn, gives birth to another baby couple 
on a monthly basis from the second month onward. How many rabbits are 
there after n months if we had one young couple at the beginning?” 

This was the question posed by Leonardo da Pisa (ca. 1170 — 1240), also 
known as Fibonacci, who gave a first mathematical model for natural growth. 
Centuries later, Euler gave recursion formulas for these rabbit numbers: 


kn = Kyat + Ke with ky = ko =, (B.2) 


Fig. B.13 flowers with 3, usually 3, and 13 petals 


The sequence is, thus, 1,1, 2,3,5,8,13,21,34,55,.... 

Although this model is very simple and ignores many additional factors, it 
is astonishing how often we encounter Fibonacci numbers, especially in the 
world of plants: The number of petals is very often a Fibonacci number (Fig. 
B.43). 

Roses often have 34 or 55 petals (Fig. B.14). 


Fig. B.14 Only verifiable after dismantling: 55 petals! 


In the case of sunflowers, daisies, and pine cones, the seeds are arranged 
spirally in such a way that when the seeds on each spiral are counted, we will 
always get a Fibonacci number (Fig. B.15). 
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Fig. B.15 Fibonacci numbers in the seed arrangement 


It is easy to prove that the quotient of two successive Fibonacci numbers 
converges to the golden proportion ® (Formula (B.1)): 
kin-1 + Kin kin 1 
q= lim = lin eS i =>q=. 


noo kj, 1 noo kat noo ky 4 q 


In this respect, the golden proportion also holds in botany ... 


A mathematical model 


Phyllotaxis is now understood as a mathematical model for the spiral-shaped 
systems seen in pine trees, sun-flowers, and other plants. 

A plant’s seedbed can grow in the following ways: In order to make room for 
a new seed, the previous seed structure is simply rotated. To avoid collision 
with the first seed structure after a full rotation, the radial distance between 
the two is increased. A smaller magnification gives a more efficient space 
distribution. 

Let us now take the following approach: We set a point Po(d,0) near the 
origin of the coordinate system. Each additional point P, arises from its 
predecessor by being rotated around the origin by a fixed multiple 6 of the 
full angle. At the same time, its distance from the origin is increased by the 
factor (1+ qye, Depending on the efficiency of the algorithm, the constant c 
can be kept as small as possible (in Fig. B.16, c = 0.58). Therefore, P,, has 
the polar coordinates (d(1 + 4)¢ no: 360°). 

The seeds naturally have a certain size, which we want to illustrate with a 
small circular disk. If we test the given rule with a computer, we soon realize 
that the angle of rotation should not be a simple fractional amount of the 
full angle, because then, the seeds are concentrated on radial lines and the 
plenum remains free. The factor 6 should, if possible, not be a fraction: So, 
we take the “maximal irrational number”, the golden proportion 6 = ®. 

We divide the full angle in the golden ratio and get the angle 


y= ®- 360° & 582.4922°. 


For c, a value around 0.6 is found to be meaningful, depending on the seed 
size. Minor changes of ¥ result in entirely different arrangements (Fig. B.16). 
This simulation is very close to reality. 
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~. i 


Fig. B.17 “Real” sunflowers turn with their “back” to the Sun! 


This does not mean that nature has genetically prescribed the abovementio- 
ned algorithm for plants and “knows” the golden proportion to 10 decimal 
places. As already stated in the introduction, mathematics is a self-contained 
microcosm which, however, has the strong ability to reflect and model arbi- 
trary processes. 

The quality of mathematical models depends, on the one hand, on the creati- 
vity of the person who has made it and, on the other hand, on the fine-tuning 
of the model’s individual components. In any case, models should be tested 
on as many examples as possible. 

Furthermore, we should always keep in mind that nature produces inaccurate 
copies. So, it can never be predicted to the last detail. This is the secret of 
evolution, without which there would be no adaptation to change (see also 
Application p. 499). 
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B.5 Imaginary and complex numbers 


In Section 1.5, we discussed the fundamental theorem of algebra (p. 41) 
which states that any algebraic equation of degree n with complex coefficients 
has exactly n complex solutions, if we take into account their multiplicities. 
The complex numbers play a very important role in many areas of graduate 
mathematics (and also in theoretical physics and kinematics). 

What are “complex numbers” really? 

To answer the question, we need the concept of an “imaginary unit” 7. It is 
defined by 


i? =-1. (B.3) 
® Remark: The conclusion 
t=V-1 (B.4) 


drawn from this is to be used with some caution, as one can see from the following 
calculation: With 


@ =4-12V-1V-1=y/(-1)(-1) =v1=1, 


one immediately has a contradiction to the usual arithmetic rules in the real number 
field. ® 


Now, a new set of numbers — the imaginary numbers — is defined. These 
numbers emerge from the real numbers by multiplying each element by 7 
(analogously to the definition of negative integers where the natural numbers 
are multiplied by -1). With real and imaginary numbers, one can solve all 
purely quadratic equations, e.g. 


x =-16=16-(-1) > x=4V16i = +4i. 


By definition, there are as many real numbers as there are imaginary num- 
bers. Now, the real and imaginary numbers can be combined to a new type 
of number: 


z=axutty. (B.5) 


Here, x and y are real, and hence, iy is imaginary. A complex number is 
defined as the sum of a real number and an imaginary number. The values x 
and y are respectively called the “real part” and the “imaginary part” of the 
complex number z. Symbolically, we write z ¢ C if a number z is complex. 
The validity of this new definition can be shown, for example, by solving the 
following quadratic equation with Formula (2.14): 

-24V22-4-1-2. -24V/4i 


g°+27+2=0> 22> ; = (B.6) 
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Conjugating complex numbers 
The solutions z; = -1-7% and zg = -1 +i in (B.6) differ only in the sign of 
the imaginary part. In this case, one speaks of “complex conjugate numbers”. 


Symbolically, one writes zg = Z,. Note that 2 = Z = z,. In general, for 
complex conjugate numbers z, Z, we have 


Z=U+IYyY > Z=L-1y. 


©») 


Yj 
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Fig. B.18 elliptic, hyperbolic, and parabolic pencils of circles 


>>> Application: pencil of circles through complex conjugate points 

The solutions sought in (B.6) are the zeros of the function y = x? + 2x + 2 
(parabola), or more generally, the zeros of all parabolas from the pencil of 
parabolas with the equations y = a(a?+2x7+2) with a € R (Fig. B.18, left). It is 
easy to imagine that all the circles with the equation (#+1)?+(y-q)? = q?-1 
(q€R, q> 1) also have the same zeros. 

No parabolas or circles of these pencils have any real points in common, but 
they intersect at a pair of conjugate imaginary points. One can compute with 
such points in the same way as with real points. 

Fig. B.18 (right) illustrates a hyperbolic pencil of circles (where the circles 
share the zero circles of the elliptic pencil) together with the elliptic pencil. 
The two pencils constitute an orthogonal grid of circles: Each circle of either 
family intersects each circle from the complimentary pencil at right angles. 
Fig. B.19 shows how a hyperbolic pencil of circles can give rise to a hyperbolic 
pencil of spheres in 3-space. <<< 


>>> Application: the absolute circle points 

With the help of complex numbers, one can prove the following curious state- 
ment: All circles in the plane have two points in common. These are complex 
conjugate points on the line “at infinity”. 

Proof: An arbitrary circle (center (p/q), radius r) has the equation 


(a-p)?+(y-q)? =1? = y=q4 Vr? -(a-p)?. 
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Fig. B.19 A hyperbolic pencil of circles gives rise to a hyperbolic pencil of spheres 
by simply revolving it about its axis. 


For us, the quotient 


y_4 V rT \2 Pyy? 

==+ 1 

i ee) 

is of interest. It converges to the absolute values +7 as x + oo because the expressions 
P 


containing those points are defined by the complex conjugate directions y = +i 2. © 


a and + converge to zero independently of p, g, and r. This means that all circles 


® Remark: By means of the absolute circle points, one can explain why two circles 
have a maximum of two real intersections in common, although — as with all other 
curves of degree 2 — a maximum of four can be expected: The two absolute circle 
points are always intersections, albeit complex conjugates and “at infinity”. In an 
elliptic pencil of circles, the remaining two intersections are real points whereas in 
a hyperbolic pencil the common points of all circles are a pair of complex conjugate 
points. If we calculate the distance d of the points (x/+7) from the origin, the value 
d = \/x? + (4ix)? = aV/1-1, i. zero, is obtained. Therefore, the two directions 
y : x = +7 are also called minimal lines. Even the term “at infinity” should be 


handled with care, because the limit value for x > oo is undetermined. ® << 
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Basic arithmetic operations with complex numbers 


Addition and subtraction of complex numbers result “informally” by adding 
or subtracting the real and imaginary parts. However, multiplication uses 
the distributive law and the definition i? = -1 of the imaginary unit. The 
arithmetic mean of two complex conjugate numbers equals 


Z+z _ (a+ty)+(x-iy) _ 


x, 
2 2 


which is the real part of z. The difference between two complex conjugate 
numbers is purely imaginary; the product is real: 


zz=(a+iy)(e-iy)=2° -Py =a? +y’. (B.7) 


With Formula (B.7), you can calculate the reciprocal value of a complex 


number: i = i 
& = 
2 22 ety? = ee 


The division of complex numbers is treated in a similar way: Expand nume- 
rator and denominator by the complex conjugate of the denominator. 


The complex plane 


Obviously, there are substantially more complex numbers than real numbers 
(or imaginary numbers):° You can choose the real part 2 and the imaginary 
part y freely. The best way to imagine the situation is geometrically. If one 
interprets x and y as the coordinates of a point P(a/y) in the plane: 

There are as many complex numbers as there are points in the plane. 

In this regard, one speaks of the complex plane. Conjugate numbers are sym- 
metric with respect to the “real axis” (abscissa). The distance of the point 
(a/y) (which corresponds to the complex number z = x+iy) from the origin 
is given by \/x? + y?, and this is equal to the expression 


|z| = zz. (B.9) 


Analogous to vector calculus, the real number |z| is called the absolute value of 
the complex number z. Complex numbers — interpreted in the complex plane 
— and two-dimensional vectors generally have something in common. The 
sum and difference of two complex numbers can geometrically be interpreted 
as vector addition or vector subtraction. That is to say, one can also perform 
elegant translations with complex numbers in the plane. 

According to Formula (B.8), the reciprocal of a complex number z is the 
reflection of z in the real axis combined with a dilation from the origin with 
the factor 1/(z-Z). 


5In the sense that complex numbers form a two-dimensional continuum. In set-theoretic terms, 
the sets of complex numbers and real numbers contain the same (infinite) number of elements. 
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>>> Application: straight lines in the complex plane 
Show that 
mz+mz=n (meC, neR) (B.10) 


is the general equation of a straight line in the complex plane, where m is 
the normal direction. 
Proof: We set m=pt+ig and z=a+iy. Then, we have 


(p—iq)(a+iy) +(ptig)(x-ty) =n. 


When multiplying, only 
2px+2qy=n 


remains, i.e. a linear equation in x and y. The normal vector of the straight line is 


a multiple of (p,q). © <<< 


>>> Application: circles in the complex plane 
Show that 


zE—-Mz-mz+n=0 (meC, n=mm-r’ eR) (B.11) 


is the general equation of a circle (center m) in the complex plane. 


Proof: An arbitrary circle with center m and radius r can be described by 


je-ml=r=>(z-m)(Z mM) =r? => 2z mz—-mz+mm-r? =0. 
e_—/” 
neR 


2 


For mm =r“, we get the circle through the origin z = 0. Conversely, each curve of 


the type 
22 -™Mz-mzZ+n=0 (B.12) 
is a circle with center m and radius r= /mm-—n. ©) <<< 


>>> Application: “reflection in the circle” (inversion) 

Consider the image w = ! (reflection in the unit circle) and show that it 
“preserves circles”, i.e. circles are transformed into circles. Especially, circles 
through the point z = 0 are transformed into “infinitely large circles”, i.e. 
straight lines. 

Proof: We consider an arbitrary circle given by Formula (B.11) 


zE-™Mz-mz+n=0 (meC, n=mm-r’ eR). 


According to the rules 


we obtain 
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Fig. B.20 reflection in the circle (inversion) 


We multiply by ww, and with 


1-mw-mw+cnww = 0, 


we obtain once again an equation of the same form as shown in Formula (B.12). 
Therefore, the transformed circle is again a circle. The exception is n = 0 (then, 
the circle passes through z = 0): Here the quadratic part vanishes, and according to 
Application p. 537, the result is a straight line with the equation 


mwt+tmw=tl 


perpendicular to the connection of the circle center m and the origin of the coordi- 
nate system. © 


® Remark: This special case is utilized in the theory of wheel gears for so-called 
‘inversors” with which, for example, a technically easy-to-implement rotation can 
be converted into exactly a straight line without the need for a straight directrix. ® 

ddd 


>>> Application: inversion in space (Fig. B.21) 

The inversion can be performed in three-dimensional space equally well. The 
inversion circle is replaced by an inversion sphere (/;1r). The points P and 
P* are, once again, linked via the relation MP -MP* = r?. 
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Fig. B.21 The inversion in a sphere “is sphere preserving”. Here, a torus (as an 
envelope of spheres) is transformed into a so-called Dupin cyclide (also of degree 
four). It inherits a multitude of beautiful properties from the torus. 


The transformation P+ P* is quadratic: Planes are, thus, mapped to sphe- 
res (in general). One can prove this by rotating the planar situated straight 
line (> circle) about the normal of the straight line through the center M. 
Surfaces of n-th order are in general transformed into surfaces of 2n-th or- 
der. Planes are also considered as spheres with “infinitely large radius” and 
“center at infinity”. Spheres and planes are transformed into spheres or pla- 
nes, i.e., “inversions preserve spheres”). The example in Fig. B.21 shows that 
surfaces created by spheres are transformed into surfaces of the same order. 
By analogy to circle reflection, we may, at this point, speak of reflection in a 
sphere. daa 


The following example and also Application p. 543 show that under certain 
circumstances, sophisticated formulas can be represented in a compact form 
by means of complex numbers. Using appropriate software, computers can 
calculate with complex numbers as well as real numbers. The effort for the 
user of such software is, thereby, enormously reduced. 


>>> Application: equipotential surface 
If z1 and zg are two arbitrary complex numbers, then they can be used to 
define with the equation 


ZR 


w(z) =| | (wéR) 


Z- 29 


the surface depicted in Fig. B.22 (the surface has been trimmed close to 
the poles). Show that for 21 = -1+% and zg = -1-i above the real axis, a 
horizontal line lies on the surface. 
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Fig. B.22 potential surface with circular contour lines 


Solution: 
The real axis is described by z = t (t € R). Thus, 


=a). (f= (- ly eis 
I-l=Ga—a |" leraal 


If we let u=t+1, we further have 


w(t) = [M4] =| Od) 


uti! W(ut+it)(u-2) 
— lu? -1-2ui| _ J(u? = 1)? + 4u? Pei? 4 
ns ee u2+1 ee 


Above the real axis, the surface points have a constant height of 1 so that 
the corresponding curve on the surface is actually a straight line. 


® Remark: It is a little bit more tedious to show that the horizontal curves on the 
surface are mapped to a hyperbolic pencil of circles in the top view (Fig. B.22, left). 
These correspond exactly to the hyperbolic pencil shown in (Fig. B.18, left). The 
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corresponding elliptic pencil (Fig. B.18, left) is the image of the lines of greatest slope 
on the surface. In physics, w can be interpreted as the potential of two electric fields. 
The horizontal curves are, then, called the equipotential curves, and the curves of 


greatest slope are called field lines. ® <<< 


The function ec? with complex numbers 


If we extend the real function e” to complex numbers by means of its power 
series expansion in Formula (8.5), then we obtain 


alters tat ate (zéC). (B.13) 


>>> Application: Euler’s formula 
The following famous relation between real trigonometric functions and the 
exponential function is due to Euler. Show that 


e'? =cosytisiny (B.14) 
or, equivalently, 
er re? ere? 
cosy = 5, sin y = — (yeR). 
i 


Proof: According to our formulas developed for real arguments, we set y ¢ R 
(Application p. 382 ff.) and get 


Pale ss Wat 
ingre BoE Bn 
coups GG 
The extension to complex numbers holds for pure imaginary numbers iy (y € R): 
eb’ =1+ip+ ee _ ae +... =1+iy & if 7 Pagee 


The above assertions can now be verified directly with the use of series expansion. © 
ddd 


Geometric interpretation of the multiplication 

The relation between the complex function e* and the real functions sin y 
and cosy leads to a very elegant means of calculation in the complex plane. 
For this, we first change from Cartesian coordinates to polar coordinates. It 


is obvious that 
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=/zr2+y?, tang ==. (B.15) 
The real and imaginary part can be computed from polar coordinates by 
z=r(cosy+i sing). (B.16) 
Together with Formula (B.14), we have the elegant representation 
zere’’. (B17) 


For r = 1 and y = 7/2, we get the unit i; hence, for r = 1 and y = 7, we get 
Euler’s identity: 


eit/2 = i, et SI. (B.18) 


If we apply the calculation rules for power functions to the multiplication of 
two complex numbers 21 and z2, we get 


Zp 2g = rp ere «rg M2 = Pyro elhP1t 2), (B.19) 


We see that the argument (polar angle yi + y2) of the product 2 - z2 of the 
two complex numbers z, and zz is the sum of the arguments (polar angles 
1 and 2) of the two complex numbers. 

Geometrically, multiplication by a complex number z = re” can be interpre- 
ted as the combination of a rotation (roation angle y) with a dilation (scaling 
factor r). In fact, multiplication by the number z = e’? is a pure rotation. 
This is the reason why complex numbers are extensively used in planar ki- 
nematics, in which, besides translations, rotations play an important role. 


2,1,1) (4,3,2, (4,-3,-2,1) 2,3,4) 2,3,4,-1) 
3,4) 1,-2,-3,4) (2,3,4,1) (2,3,-4,1,-4) 


MOeoPds 


Fig. B.23 trochoids of order 4: upper values: rz, lower values: wp 


>>> Application: trochoids (Application p. 300) 

A trochoid is generated by rotating a first rod (length r,) at a constant 
angular velocity w; while a second rod (length rz) rotates about the end point 
of the first at the proportional angular velocity wz. What is the equation of 
such a curve in the complex plane? 
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Solution: 
Assume that t is the elapsed time. Then, the center of the second rotation has 
the position z, =r; e“!*. The end point of the second rod has the position 


t dw t 


z=m tree" =rye ee 


+79: e 
The superposition of rotations can easily be generalized, and this will result 
in so-called “trochoids of order n”, which can be represented in the complex 
plane as 

n . 

Z= > rpervet, 

k=1 
It can be shown that any planar curve can be approximated in parts to an 
arbitrary precision by a trochoid of sufficiently high order. <<< 


Powers and roots of complex numbers 


The representation of complex numbers by Formula (B.17) also allows us to 
compute powers and roots of complex numbers: If one multiplies a number z 
by itself, the radius must be squared and the polar angle must be doubled. If 
one takes the number to the power n (i.e. multiplying it n times with itself), 
one has 

z” =(r(cosy +isiny))” =r"(cosny +isinny). 


The exponent n may also take non-integer values, and the calculation rules 
for powers (and for roots) can be sensibly carried over to complex numbers. 
The coefficients of a quadratic equation, for example, can also be complex. 
Then, the complex solutions following Formula (2.14) are generally no longer 
complex conjugate. 


>>> Application: “Fermat surfaces” 

A famous theorem of Fermat, which has only recently been proved in a 
sophisticated way, states that the expression a” +b” = c” has positive integer 
solutions (a,b,c) only for n < 2 (Application p. 121). Thus, for n > 2, we can 
safely divide by c” (without omitting integer solutions) and obtain w” + 2” = 
1=>w(z) = V1- 2” (with w =a/c and z = b/c, ignoring the sign). If we also 
allow complex numbers z, then of course, the result w is also complex. By 
forming the absolute value, we make the result real (as we did before, in the 
case of the equipotential surface) and consider the surface 


w(z) =|V1-2”| 


over the complex plane. Fig. B.24 shows two extreme views of the surface for 
n= 5 (for a “common view” see p. 519). Calculate its zeros. 

Solution: 

The equation w = |W1-z°| = 0 is to be solved. From |w| = 0, it follows 
unambiguously that w =0+02=0, so that we obtain VY1-2=051-2= 
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Fig. B.24 aesthetics in mathematics: extreme perspectives of the function graph 


0 => z° = 1. Transformations of equations in complex numbers have to be 
done very carefully, especially when absolute values are involved. 

In polar coordinates, we have 1 = (1; 27k) with an arbitrary integer k. The 
fifth roots are, thus, (\/1; a k) with k=1,...,5, and they form the vertices 
of a regular pentagon (Fig. B.24) with the coordinates (1/0), (cos 72°/ + 
sin 72°), (cos 144°/ + sin 144°). 444 


>>> Application: formulas for sinz and cosz for complex numbers 
Let us define the complex sine function by allowing complex arguments in 
Formula (B.14): 


sin z = - 
21 
With z=ax+iy (2,y€R), we haveiz=-y+iz, -iz=y-iz, and then, 
e* =e Y'* =e Ye'* =e Y(cosrt+isinz), 
@ "=e" = ee ** =e (cosx —isin x). 
Therefore, we have 


e"(cosz + ising) —e¥(cosa—isina) _ 


sin z = 


2i 
1 6 ¥—eY e¥+ey . 
= > ———- cosx + ——— sing, 
a 2 2 
Ww 


1 


and with Formula (6.27), the relation 
sin z = sinxcoshy+icoszsinhy. (B.20) 


Analogously, for the complex cosine function, we have 


e ¥(cosx+isinzg)+e¥(cosx-isinz) e4%+e¥ ll Ee 
COs z = 7 = 3 cosa - i 5 sin 
=> cos z = cosacoshy — isin z sinh y. (B.21) 


B.5 Imaginary and complex numbers 545 


Conformal mappings 

A conformal mapping in the plane is an angle preserving transformation of 
a domain to another domain. Under this mapping, orientations of planar 
figures and the angle of intersection of two arbitrary curves are also preser- 


ved. Such mappings occur, for example, in electrical engineering, geodesy, 
hydrodynamics, and aerodynamics. 


Fig. B.25 conformal mapping w = z + 1/z, applied to a circle 


>>> Application: flow profiles 

In flow theory, the conformal mapping w = z + 1/z is important. Fig. B.25 
shows how a circle (center z = 0.5 + 0.357, radius 2) is “transformed” by 
this mapping into a wing profile. Flows around circular profiles are easier to 
handle and the results can be carried over to wing profiles. <<< 


>>> Application: conformal patterns 

Describe the images of xz-parallel and y-parallel lines as well as the images of 
arbitrary straight lines under the conformal mapping w = e*. Show further 
that this mapping is angle preserving. 

Solution: 

We have w = e?* = e”-e¥. For constant y (x-parallel), w is a complex number 
with radius e” and a fixed polar angle y. Thus, all these complex numbers 
correspond to points on straight lines through w = 0. The distance from the 
origin increases exponentially with x. For constant x (y-parallels), all these 
complex numbers correspond to points with constant (equal) absolute value 
and varying polar angle y. 

For a generic straight line y = kx + d (k, d€ R), the mapping results in 


L eilka+d) - eid et. ike 


w=e 
With variable x, the number e’*” travels along the unit circle with the velocity 
kx proportional to x. By multiplying by the real value e”, the radial distance 
x is scaled exponentially. A rotation through the angle 1 corresponds to a 
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Fig. B.26 flower-like patterns using w = e* 


scaling with the factor 1/k. Therefore, the complex number (point in the 
complex plane) moves along a logarithmic spiral with the constant course 
angle » = arctank (Application p. 327), which corresponds to the slope of 
the initial line. Multiplication by the constant value e’? causes an additional 
rotation of the spiral through the constant angle d. Thus, parallel lines are 
mapped to a family of congruent logarithmic spirals. The fact that the slope of 
the lines and the course angles of the spirals agree confirms that this mapping 
is conformal. In fact, the angle of intersection of two curves is measured 
between the tangents at the common point. These are two arbitrary straight 
lines (see Application p. 264). 


Fig. B.27 conformal mappings of groups of sinusoidal curves (Franz Gruber) 


Fig. B.26 on the left illustrates this for k = +1, i.e. straight lines inclined at 
+45°. The spirals intersect each other like the straight lines at right angles. 
In order to obtain the image in Fig. B.26 (center) or the one on the right, 
congruent sinusoids whose central lines are parallel to the x-axis are subjec- 
ted to the mapping. In this way, you can easily obtain pretty patterns that 
are reminiscent of flowers and are difficult to describe without conformal 
transformations. 

In Fig. B.27 on the left, the complex sine function was applied to a family 
of sinusoidal curves. The middle axes of the sinusoid is xz-parallel. The two 
other figures show images of the sinusoids after applying the complex tangent 
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function w = tan z = sinz/ cos z. Read more on the website that accompanies 
this book. << 


>>> Application: a practical nautical chart 

In the 16th century, Kremer (lat. Mercator) designed a nautical chart on 
which straight lines should give the best lines of navigation between the 
continents. If P(A, y) are the geographic coordinates (length and width) of 
a point P on the globe, then the point P° in the map has the coordinates 
(A, Ing) (Fig. B.28). 


Fig. B.28 Mercator projection = stereographic projection + conformal mapping 


The people did not trust the matter until, a century later, Kauffmann (lat. 
also Mercator) verified the case. Not surprisingly, the projection is called 
“Mercator projection”. With today’s knowledge, the matter can quickly be 
explained: The points of the Earth are stereographically projected (Applica- 
tion p. 331) from the North Pole to the tangential plane in the South Pole 
(P+ P*). The meridian / and the circle of latitude k are mapped to a straight 
line /* and the circle k*. The mapping is conformal and circle-preserving. 

The pattern that we have obtained consists of radial lines and concentric 
circles. We achieved this by mapping a square grid with the conformal map- 
ping w = e* (Application p. 545). If we apply to the above-mentioned pattern 
the inverse of the exponential function, that is, the equally conformal (angle- 
preserving) complex natural logarithm w = In z, then we can expect to obtain 
a rectangular grid: This results in points P°, straight meridian circles /°, and 
straight circles of latitudes b°. The shortest routes for shipping would be great 
circles k. Unfortunately, these are mapped — viewed globally — to sinusoidal 
waved curves k°. At that time and in the ideal case, a captain was steering 
a course with constant angle (against the meridians), and therefore, not too 
far from a great circle on a loxodrome on the sphere (Application p. 330). 
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This loxodrome is mapped to a straight line on the nautical chart because of 
the angle preserving property. <<< 


>>> Application: minimal curves 
Assume that R= constant; R,T,X,Y,Z ¢C and show that the complex helix 


X = RsinT, Y = ReosT, Z=iRT (B.22) 


has arc length zero, and thus, deserves the name minimal curve. With real 
numbers, this is impossible. 

Proof : We have to show that the distance between two neighboring points is always 
zero, that is, the “arc element” vanishes: V X2+ Y?+Z? = 0. However, with X = 
ReosT, Y =-RsinT (> X?+Y? = R?), Z=iR (= Z? = -R?), this is actually 
always fulfilled. © «<< 


>>> Application: bending of minimal surfaces 

Weierstrass proved that the real part of a minimal curve is a minimal surface 
(see Application p. 433). Calculate the real part of the complex helix from 
Formula (B.22) for R=1 and R =i. 


Fig. B.29 bending the catenoid (gray) into the helicoid (yellow) 


Solution: 

In this example, for the sake of simplicity, real numbers are written in small 
characters and complex numbers (except the imaginary unit 7) in capitals. 
For R= 1, Formula (B.22) gives X = sinT, Y =cosT, Z =iT. 

We set T =u+iv. Then, according to Formula (B.20) and Formula (B.21), 
the real parts thereof equal 
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x =sinucoshv, y=cosu coshu, z= -v. (B.23) 


If u and v are interpreted as surface parameters, then we can see in these 
equations the parameter representation of a catenoid (see Application p. 433 
or Fig. B.29 on the left). 

For R =i, Formula (B.22) gives X =isinT, Y =icosT, Z=-T. 

The real parts thereof are x = —cosusinhv, y=sinu sinhv, z = -u or, with 
the substitution r = sinhv 


r=-reoosu, y=rsinu, z=—wu. (B.24) 


With the surface parameters r and u, this is a parameter representation of a 
helicoid (see Application p. 309 and Fig. B.29, right). 


Fig. B.30 “dance of minimal helical surfaces” 


For the remaining complex unit numbers R = e’?, there are surfaces which 
are “intermediate stages” between the catenoid and the helicoid (again, a 
parameter representation is available). They are all minimal surfaces. Their 
calculation and representation is left to the computer. On the website that 
accompanies this book you will find a program that shows you the con- 
tinuous bending from any desired view (minimalflaechen.exe). Since the 
program can calculate with complex numbers as well as with real numbers, 
the programming effort was very low. 
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If the minimal helix from Application p. 548 is replaced by another minimal 
curve, a variety of minimal surfaces is obtained which can be continuously 
bent into one another. The surfaces in Fig. B.30 are found when spiral lines 
are selected instead of helices. 


® Remark: The last examples should have made clear in what an incredibly efficient 
manner certain facts can be described using complex numbers. “From a higher point 
of view”, or “in a higher dimension”, complicated connections sometimes appear 
quite obvious. Likewise, when we move from the plane into space, we may see many 
similarities of conic sections in a different light. @ <<< 


Fig. B.31 Two fairly similar minimal surfaces: The one on the left is generated by 
formulas, the one on the right by “biological necessities”. 


>>» Application: minimal surfaces in nature (Fig. B.31) 

If we look at the computer generated surface (it is, by the way, the famous 
Enneper surface, a rational minimal surface of degree nine), and compare it 
with the brown algae to the right, the similarity is stunning. Why do surfaces 
in nature tend to be minimal? 


Solution: 

A minimal surface can be bent into an infinite series of associated minimal 
surfaces, whereby the metric on the surface does not change. Points on the 
surface (skin, etc.), thus, have a constant distance (measured by means of 
“geodesic curves”, i.e. shortest curves between the points). Even angles do not 
change. Therefore, a right triangle, for instance, would remain a right triangle. 
For animals or plants with surfaces that come close to minimal surfaces, this 
allows fast changes of shape without major impact to the surface. <<a 


>>> Application: Mandelbrot sets (Fig. B.32, Fig. B.33) 

Since Benott Mandelbrot published his classic “Fractal Geometry of Nature” 
in 1977, fractals have become very popular. They play an important role in 
science. Let us consider the complex function f(z) = z? +c. Then, we define 
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a sequence of complex numbers 20, 21, 22,.-. as follows: zo = 0,21 = f(zo) 

Cage eni= Filey) = 2 +c. If the sequence z,, remains bounded, then the 
number c is considered to be “good”, and we mark it with a black point in the 
Gaussian plane. The result is a strange and very popular set, named after 
Benoit Mandelbrot. How is it possible to determine whether z,, is bounded? 


4 ng) 
2 “a 
; be ew © : 
Cos 


Fig. B.32 Mandelbrot set: We start with a complex value c (marked in red). Then, 


we apply the function f(z) = z? +c several times (green). If the results stay inside a 
circle of radius 2 for, say, 100 times (left-hand side), we mark c in black. If they do 
not (right-hand side), we mark the point in another color, depending on the number 
of iterations needed to determine that z,, does not remain bounded. 


Solution: 

Without proof, z, will “blow up” if its radius of convergence is greater than 2. 
We let the computer do the work: Apply f(z) until this happens. If it does 
not happen for a reasonable number of tries (e.g., 100), we consider c as 
forming part of the set. 


Fig. B.33 The Mandelbrot set is a classical fractal: You can zoom in as much as 
you want. The resulting borderlines will not change. 


Yet, in very few cases, this might be too inaccurate and we have to consider 
that the set is so extremely complicated that it is simply not worth checking 
further: Even if we zoom into the set again and again, no distinct borderline 
will appear. 
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Fig. B.34 Marble fractals: These photos look as if they were computer-generated. 


Strictly mathematically, a fractal is a set that exhibits a repeating pattern 
displayed at every scale. In nature, this will not happen with the same accura- 
cy. Nevertheless, when cutting stone, especially metamorphic rock composed 
of recrystallized carbonate minerals (marble), the cross-sections look quite 
similar to mathematically generated fractals (Fig. B.34). <<< 
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roof inclination, 377 
optimization problem, 374 
orbit length, 318 
orbital period, 92, 159 
organ, 88, 509 
Orion belt, 232 
orthogonal 
grid, 144 
matrix, 313 
projection, 304 
orthogonality, 198 
oryx antelope, 429 
oscillate, 34 
oscillation duration, 384 
oscilloscope, 355 
osculating 
circle, 338, 340, 348 
parabola, 387 
plane, 338, 339 
ostrich, 81, 257 
Otto-engine, 276, 277 
overbooking, 489 
overtone series, 502, 513 
oxygen, 35, 77, 83, 238 
supply, 84 
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P perpendicular bisector, 348 function, 244, 542 
perpetuum mobile, 21 series, 381, 417, 541 
palm, 127 Perseids, 337 powers of ten, 11 
Palophus titan, 83 perspective, 385 precision, 8 
Pangea, 51 petal, 530 prevalence, 464 
parabola, 33, 242, 268, 333, phase change, 94 primary 
434, 534 photo plane, 54 law, 504 
as envelope, 342 photograph, 161, 304 rainbow, 167 
cubic, 33 photography machine, 303 prime number, 452, 502 


of degree 6, 33 
osculating, 387 
unit, 242 
parabolic 
cylinder, 387 
flight, 61 
point, 388 
paraboloid, 387 
elliptic, 387 
hyperbolic, 387 
parachute, 70 
parallel, 27 
angles, 145 
displacement, 389 
projection, 334 
parallelogram, 177 
of forces, 177 
parameter, 307 
parameter representation, 184, 
315, 325, 361 
in polar coordinates, 326 
of a catenoid, 549 
of a circle, 315 
of a circle in space, 319 
of a clothoid, 418 
of a curve, 325 
of a helical motion, 307 
of a helicoid, 549 
of a line in space, 334 
of a meridian, 369 
of a space curve, 325 
of a straight line, 184 
parametrization, 315, 360 
parasympathetic system, 498 
partial 
reflection, 165 
resistance, 19, 27 
Pascal, Blaise, 456 
passenger aircraft, 113, 171 
path tangent, 301, 318 
path-time 
diagram, 140 
function, 261 
paw print, 86 
pectoral fin, 217 
pedal point, 199 
pencil of 
circles, 534, 540, 541 
conics, 144 
parabolas, 534 
rays, 57 
pendulum, 20, 384, 420 
penguin, 113 
pension calculation, 497 
pentagon, 544 
dodecahedron, 205 
pentaprism, 215 
pentatonic, 505 
penumbra aura, 126 
pericarp layer, 321 
perihelion, 159, 285 
period 
of oscillation, 20, 420 
orbital, 92, 159 
sidereal, 93 
periodic system, 514 
permutation, 472 


phyllotaxis, 531 
physics, 8 
phytoplankton, 107 
piano keyboard, 506 
picture element, 74 
pie chart, 439, 440 
pine 
cone, 530 
tree, 531 
pipe surface, 351 
piston speed, 276 
pitch circle, 318 
pitched roof, 191 
pixel, 42, 72 
planar 
polygon, 68 
plane, 177, 489 
normal, 229 
of symmetry, 192, 229 
radical, 148 
sensor, 19 
planet, 82 
planetary 
gear, 44, 115 
motion, 300, 301 
Plato’s school, 45 
Platonic solids, 231 
Pleiades, 232 
pliers, 108 
Pluto, 158 
pocket calculator, 381 
poikilothermal, 257 
point 
at infinity, 30 
charge, 493 
distribution, 492 
of contact, 348 
of inflection, 338 
principal, 385 
random, 492 
sampling, 438 
target, 304 
points in a decathlon, 244 
Poisson distribution, 485 
Poisson, Siméon Denis, 485 
polar 
angle, 131, 177, 274, 326 
coordinates, 326, 331, 
332, 334, 541 
developable, 340 
night, 324 
Pole Star, 304, 364 
polygon, 68 
polyhedron, 205 
population 
growth, 254 
on the Earth, 429 
profile, 439 
statistics, 259, 287 
Portuguese galley, 364 
position 
Mars closest to Earth, 


159 
vector, 176, 184, 302 
positioning, 151, 363 
potential energy, 181, 401, 420 
power 


primordial soup, 110 
principal 
determinant, 58 
diagonal, 29 
normal, 338 
point, 385 
ray, 16 
principle of appearance, 512 
prism oblique, 404 
probability, 453 
distribution, 445, 482 
theory, 392, 417 
product 
event, 451 
matrix, 313 
rule, 267 
professional camera, 110 
profit 
function, 23 
outlook, 449 
projection 
central, 304 
cylindrical, 128 
normal, 260 
orthogonal, 304 
parallel, 334 
rectangular, 52 
stereographic, 331 
pronation, 179 
Propalaeotherium, 112 
proportionally, 69 
proportions, 15, 67 
protozoa, 99 
pseudo-random number, 491 
pseudopupil, 215 
Pteranodon, 82 
Ptolemy, 502 
pulleys, 181 
pulse, 513 
pygmy shrew, 85 
pyramid 
oblique, 404 
of Cheops, 122 
of Chephren, 74, 121, 221 
ramp, 136 
truncated, 136, 186, 187 
pyramidion, 136 
pyramids of Giza, 74 
Pythagoras, 204, 396, 502, 503, 
525 


Pythagorean 
comma, 507, 510 
number, 511 
ordering, 503 
scale, 515 
school, 121, 502, 512 
triples, 121 


tuning, 507 

quadratic 
equation, 23, 36, 37, 543 
facet, 215 


quadrifolium, 43 

quality control, 489 
quantum mechanics, 499 
queen of the functions, 265 
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queue, 475 
quintessence, 526 
quotient rule, 268, 269 


R 


Reaumur, 24 

rabbit, 530 

race course, 9 

radial line, 531 

radian, 119, 125, 141, 142, 157, 
315 


radical 
line, 317 
plane, 148 
radio telescope, 343 
radius of 
curvature, 348 


the Earth, 91, 92 
rafter’s length, 147 
rail shot, 210 
rainbow, 166 
rainwater, 377 
random 

experiment, 453 

number, 491 

number generator, 491 

point, 492 
raptor, 99 
rational number, 524 
reaction time, 10 


real 

axis, 536 

function, 238 

line, 238 

number, 8, 533 

part, 533, 548 
reciprocal, 536 
recompression, 77 
reconstruction of date and ti- 

me, 

rectangular 

map, 321, 322 

projection, 52 

strip, 358 

tripod, 204 
rectifying 

developable, 358, 359 

plane, 338, 339 
recursion formula, 470, 530 
red sunset, 168 
reducible equation, 42 
reduction, 515 
reflecting cuboid corner, 210 
reflection, 152 
in a circle, 151, 537, 539 
in a plane, 207 
in a point, 185 
of a light ray, 208 
reflector of a lamp, 199 
refraction, 16 
at a plane, 41 
at a spherical surface, 165 
refractive constant, 299 
regular 
n-gon, 141 
surface, 350 


tive 
body strength, 85 
frequency, 453 
remaining life expectancy, 496 
representing fractions, 15 
reptile, 257 
residual mass, 51 


rela 


retirement 
benefits, 497 
system, 440 


return over long time, 248 


reversed, 209 
revolution period, 93 
revolutions per second, 96 
Reynolds number, 116, 118 
rhino, 87 
rhythm, 513 
right 
curve, 371 
strophoid, 44 
triangle, 119 
rigid body motion, 300 
Rio de Janeiro, 225 
road construction, 418 
rock erosion, 47 
rocket, 403 
roll over, 252 
rolling 
a circle, 301 
dice, 451, 487 
root, 23, 41 
rope force, 60 
rose, 530 
Rosetta Stone, 522 
rotating 
a graph, 410 
a straight line, 200 
rotation, 542 
about a line, 228 
about arbitrary axes, 305 
about the origin, 298, 302 
about vertical axes, 301 
generic, 195 
matrix, 297, 298, 302 
of the firmament, 304 
of the Moon, 421 
rotoid, 325 
rotoidal 
helicoid, 350 
motion, 305 
row vector, 312 
rule, 237 
runner, 9 


S 


saddle point, 371 
safety stop, 419 
Saint Petersburg paradox, 476 
salmon, 257 
sample, 442 
sampling point, 438 
sarcophagus, 97 
satellite, 13, 316 
receiver, 343 
Sauveur, Joseph, 502 
scale, 505 
scalene triangle, 145 
scaling, 181 
a vector, 181 
function, 385 
Schlick, Arnold, 509 
scorpion, 81 
screwdriver, 115 
screwing a sphere, 353 
sea 


ice, 83 
level, 25, 47 
seal, 89, 164 
secant, 348 
second, 505 
secondary rainbow, 167 
sensor 
sensitivity, 70, 71 
sensor plane, 19 
sequence, 530 
series 
expansion, 381 
of points, 213 


sexual proportion, 493 
shadow, 126 
of a point, 193 
shark, 65, 77, 217, 447 
attack, 447 
design, 118 
great white, 81, 447 
oceanic whitetip, 217 
reef, 77 
shipping, 211 
shoaling fish, 65 
shooting star, 337 
shortest flight route, 322, 361 
shot put, 244 
shrew, 445 
shutter speed, 70 
Siberia, 48 
Siberian tiger, 88 
Sicily, 324 
sidereal period of rotation, 93 
similar 
humans, 106 
objects, 67 
triangles, 15 
similarity, 68 
simple pulley, 181 
Simpson formula, 134, 416 
simulation, 531 
sine, 131, 271, 382, 386 
single child, 495 
sinusoid, 322, 355, 360, 410 
Sirius, 233 
slope, 260 
SLR, 215 
small stellated dodecahedron, 
526 
smuggler, 461 
snail 
king, 330 
shell, 350 
Snel, van Royen Willebrord, 


378 
Snell’s law, 164 
soil bacteria, 442 
solar 
eclipse, 93, 156, 160, 423 
power plant, 227 
Soli Deo Gloria, 513 
solstice, 226 
solution, 23 
multiple, 41 
sound 
consolidation, 503 
system, 505, 507 
source code, 491 
South Atlantic, 83 
southern 
Cross, 226, 364 
hemisphere, 170, 305 
space 
capsule, 403 
Euclidean, 175 
Gaussian, 56, 166 
shuttle, 403 
spacecraft, 40 
spectral color, 166, 168, 170 
speed of light, 13 
speed-time function, 261 
speeding up a process, 19 
sphere, 415 
Monge’s technique, 322 
spherical, 16 
cap, 105, 282, 411, 415 
coordinates, 222, 336 
rectangle, 215 
segment, 352, 409 
triangle, 128 


Index 


561 


trigonometry, 323 
spindle torus, 126 
spiral 
staircase, 385 
surface, 351 
spiral of fifths, 511 
spiral-shaped system, 531 
spirit level, 97 
spline curve, 290, 438 
spotlight, 227 
spring force, 307 
square, 69 
squinting, 209 
stability of a ship, 346 
stag beetle, 56, 108 
staircase function, 288 
standard deviation, 445 
star, 109 
star-shaped polyhedra, 526 
starry sky, 126 
statistics, 259, 437 
steering wheel, 418 
Steiner, Jakob, 174 
step function, 495 
stereo image, 208, 209, 289 
stereographic projection, 331 
stereoscopic, 209 
sternum, 187 
Stevin, Simon, 511 
Stone Age 
painting, 232 
people, 48 
Stonehenge, 221 
strained rope, 207 
strange 
golden ratio, 528 
thread, 69 
stratosphere dive, 58 
street map, 8 
string, 3, 501, 502 
strophoid, 44 
Strouhal-number, 293, 294 
sub-determinant, 30 


submersion of a 
balloon, 245 
sphere, 283 

subnormal, 343 

substitution, 28, 42, 431 
method, 30, 34 

subtraction, 8, 14, 536 


suicide, 486 
sum 
event, 451 


of angles, 145, 150 
summation rule, 266 
summer, 227 

month, 227 
Sun, 94, 126, 156, 304, 320, 

327, 336 

gear, 114 

irradiation, 226 

position, 224 

ray, 126, 208 
sunburn, 72 
sundew, 86 
sunflower, 2 
sunrise, 226 
sunset, 221, 225, 226, 422 
superellipse, 327 
superhyperbola, 327 
superposition of rotations, 543 
supersonic speed, 58 
supplementary 

angle, 199 

arc, 150 
supposed similarities, 78 


surface, 297, 350 

area, 67 

element, 407 

of a spher. segment, 408 

of a sphere, 408 

of a torus, 408 

of revolution, 322 

of the Earth, 107 

point, 387, 540 

sphere, 409 

tension, 111 
swarm rules, 65 
sweepstake, 457, 472 
Switzerland, 523 
Syene, 122, 125 
symmetry plane, 338 
sympathetic system, 498 
synchronization time, 57 
synchronously twisting, 179 
synodic orbital period, 93 
syntonic comma, 509 
system 

formation, 505 

of linear equations, 27 

time, 491 


T 


Table Mountain, 47 
take-off velocity, 162 
tangent, 131, 240, 269, 271, 
337, 348, 383 
definition, 258, 369 
inclination, 258, 270, 367 
limit, 258 
of a curve, 348 
of a sinusoid, 264 
of inflection, 373 
tangent surface, 358 
of a space curve, 354, 388 
tank with compressed air, 419 
target 
function, 374 
point, 304 
Taylor expansion, 528 
Taylor, Brook, 381 
teardrop shape, 352 
tearing edge, 81 
tectonic plates, 51 
teleconverter, 72 
temperament control, 507 
temperature 
compensation, 83 
scale, 24 
tempered 
duodecatonic, 506 
tones, 510 
terminator, 189, 321 
tessellation, 354, 360 
Tetra Pak, 101 
tetrahedron, 186, 231, 526 
the tree of Pythagoras, 124 
Theorem of 
angle of circumference, 


151 
Pythagoras, 68, 119, 176 
Pythagoras, spatial, 204 
right angles, 188-190, 203 
Thales, 122 
thermo oil, 343 
third, 505 
Thompson’s problem, 493 
three-door problem, 478 
three-phase current, 134 
three-sided glass prism, 170 
throw, 142 
thrust, 180 


thumb width, 126 
tidal bulge, 186 
time, 8, 9, 224 
meter, 512 
zone, 171 
timing, 513 
tiny mice, 86 
tomato juice, 465 
tonal value, 506 
tone row, 502 
torque, 206, 217 
compensation, 187 
Torricelli, Evangelista, 174 
torsion, 338 
tortoise, 369 
torus, 325, 345, 351, 407, 416, 
539 
spindle, 126 
total 
reflection, 165 
resistance, 19, 27 
toy animal, 72 
trachea, 83, 108 
track speed, 91, 92 


traffic 
fatalities, 442 
light, 475 
train, 5, 46, 154 
trajectory, 163 
transatlantic flight, 171 


transcendental 
equation, 23 


function, 381 
number, 527 
translation, 298, 542 
vector, 177, 305 
transposing, 509 
traps of reasoning, 475 
triangle 
geometry, 172 
Indian, 121 
oblique, 119 
right, 119 
scalene, 145 
triangulate, 68, 205, 218 


trigonometric function, 119, 
. 266 

trigonometry, 119 

tripod, 313 


trochoid, 300, 301, 346, 542 
trochoidal motion, 300 


true 
full moon, 422 
solar time, 224, 225 
time, 171 

truncated pyramid, 136, 186, 


tsunami, 107 

tubular chimes, 517 
tumor, 458 

Tutanchamun, 74 

Tutmose III, 523 
two-dimensional being, 324 


U 


Ulam, Stanislaw, 1 
ultraviolet rays, 166 
umbra, 160 
uneven tone, 510 
unfair offer, 252 
unfolding 
of a cylinder of rev., 353 
uniform motion, 38 
unit 
circle, 131 
of measurement, 9 
parabola, 242 
vector, 197 
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universal, 351 
universal joint, 219, 275, 303, 


352 
unstable form, 434 
upper fifth, 508 
upward throw, 38 
urn, 450 


V 
V-belt, 136 
variable, 14 
variance, 372, 445 
vector, 176, 312 
addition, 176, 298, 536 
analysis, 175 
equation, 184, 207 
field, 436 
normal, 188, 192, 537 
subtraction, 536 
unit, 197 
zero, 177 
velocity, 8, 9, 178, 295 
diagram, 140 
vector, 179 
vertex, 335 
tangent, 343 
Vienna, 109, 224, 361, 377 
Viganella, 228 
vignetting, 73 
Vikings, 364 
violet, 166 
virtual object, 209 
viscous force, 116 
Vitruvian Man, 62 
vocabulary test, 482 
voltage difference, 135 
volume 
element, 74 
function, 429 
of a beam connection, 427 


f a cone, 404 

a convex body, 205 
a cylinder of rev., 405 
a drinking glass, 412 
a frustrum, 411 

f a solid of rev., 408 

f a sphere, 408 

a spherical cap, 411 
f a tetrahedron, 205 

a torus, 406, 408 

f the Moon, 76 
vortex, 179 

voxel, 74 


W 


waistline, 202 

Wallis, John, 527 

waning moon, 422 

Wankel engine, 345 

warm-blooded, 99, 257 
animal, 83 

wasp, 82 

water, 50 
buffalo, 88 
column, 254 
pressure, 254 
resistance, 114 
strider, 80 

water flea, 117 

wear of a car tire, 12 

weather satellite, 94 

web addresses, 4 

Wegener, Alfred, 51 

weight of an antelope’s horn, 
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428 
weightlessness, 61 
Werckmeister, Andreas, 510 
western sound system, 505 
whale, 80 
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whirligig beetle, 89 
whole tone, 504 
scale, 505 
wide-angle lens, 55, 385 
wing 
motion, 179 
surface, 81 
twisting, 180 
wing-span, 81 
Witch of Agnesi, 269 
witch’s 
one-time-one, 520 
wobbler, 388 
wolf tone, 510 
worm gears, 305 


Xx 


a-axis (axis of abscissas), 438 
xylophone, 517 


Y 


Y-connection, 32 
y-axis (axis of ordinates), 438 
yen loans, 252 


Z 


z-axis (rotation), 302 
zenith, 225 
Zeno’s paradox of the tortoise 
and Achilles, 369 
zero, 281 
vector, 177 
zodiac, 232 
zoom factor, 72 
zooplankton, 107 


